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(57) Abstract: The present invention concerns fusion of 
Fc domains with biologically active peptides and a process 
for preparing pharmaceutical agents using biologically 
active peptides. In this invention, pharmacologically active 
compounds are prepared by a process comprising: a) selecting 
at least one peptide that modulates the activity of a protein of 
interest; and b) preparing a pharmacologic agent comprising 
an Fc domain covalently linked to at least one amino acid of 
the selected peptide. Linkage to the vehicle increases the 
half-life of the peptide, which otherwise would be quickly 
degraded in vivo. The preferred vehicle is an Fc domain. The 
peptide can be selected, for example, by phage display, E.coli 
display, ribosome display, RNA -peptide screening, yeast-based 
screening, chemical -peptide screening, rational design, or 
protein structural analysis. 
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Modified Peptides as Therapeutic Agents 
Background of the Invention 

Recombinant proteins are an emerging class of therapeutic agents. 
5 Such recombinant therapeutics have engendered advances in protein 
formulation and chemical modification. Such modifications can protect 
therapeutic proteins, primarily by blocking their exposure to proteolytic 
enzymes. Protein modifications may also increase the therapeutic 
protein's stability, circulation time, and biological activity. A review 

1 0 article describing protein modification and fusion proteins is Francis 
(1992), Focus on Growth Factors 3:4-10 (Mediscript, London), which is 
hereby incorporated by reference. 

One useful modification is combination with the "Fc" domain of an 
antibody. Antibodies comprise two functionally independent parts, a 

15 variable domain known as "Fab", which binds antigen, and a constant 
domain known as "Fc", which links to such effector functions as 
complement activation and attack by phagocytic cells. An Fc has a long 
serum half-life, whereas an Fab is short-lived. Capon et al. (1989), Nature 
337: 525-31. When constructed together with a therapeutic protein, an Fc 

2 0 domain can provide longer half-life or incorporate such functions as Fc 
receptor binding, protein A binding, complement fixation and perhaps 
even placental transfer. Id. Table 1 summarizes use of Fc fusions known in 
the art. 
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Table 1 — Fc fusion with therapeutic proteins 



Form of Fc 


Fusion 


Therapeutic 






partner 


implications 


Reference 


IgGi 


N-terminus of 
CD30-L 


Hodgkin's disease; 
anaplastic lymphoma; T- 
cell leukemia 


U.S. Patent No. 
5,480,981 


Murine Fcy2a 


IL-10 


anti-inflammatory; 
transplant rejection 


Zheng et al. (1995), J. 
Immunol. 154: 5590-600 


lgG1 


TNF receptor 


septic shock 


Fisher et al. (1996), N. 
Enal. J. Med. 334: 1697- 
1702; Van Zee, K. et al. 
(1996). J. Immunol. 156: 

ooo-i on 


IgG, IgA, 
igM, or ign 
(excluding 
the first 
domain) 


TNF receptor 


inflammation, autoimmune 
aisoraers 


U.S. Pat. No. 5,808,029, 

i C O 1 t r\r\ C q r»to YY\ Kor i f-\ 

lobUcU Ooplfc;! 1 lUfcJI IO, 

1998 


lgG1 


CD4 receptor 


AIDS 


Capon et al. (1989), 
Nature 337: 525-31 


lgG1, 
lgG3 


N-terminus 
of IL-2 


anti-cancer, antiviral 


Harvill et al. (1995), 
Immunotech. 1: 95-105 


IgGi 


C-terminus of 
OPG 


osteoarthritis; 
bone density 


WO 97/23614, published 
July 3, 1997 


IgGi 


N-terminus of 
leptin 


anti-obesity 


PCT/US 97/23183, filed 
December 11, 1997 


Human Ig 
Cy1 


CTLA-4 


autoimmune disorders 


Linslev (1991), J. Exp. 
Med. 174:561-9 



A much different approach to development of therapeutic agents is 
peptide library screening. The interaction of a protein ligand with its 
5 receptor often takes place at a relatively large interface. However, as 

demonstrated for human growth hormone and its receptor, only a few key 
residues at the interface contribute to most of the binding energy. 
Clackson et al . (1995), Science 267: 383-6. The bulk of the protein ligand 
merely displays the binding epitopes in the right topology or serves 
10 functions unrelated to binding. Thus, molecules of only "peptide" length 
(2 to 40 amino acids) can bind to the receptor protein of a given large 
protein ligand. Such peptides may mimic the bioactivity of the large 
protein ligand ("peptide agonists") or, through competitive binding, 
inhibit the bioactivity of the large protein ligand ("peptide antagonists"). 
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Phage display peptide libraries have emerged as a powerful 
method in identifying such peptide agonists and antagonists. See, for 
example, Scott etal. (1990), Science 249: 386; Devlin etal. (1990), Science 
249: 404; U.S. Pat. No. 5,223,409, issued June 29, 1993; U.S. Pat. No. 
5 5,733,731, issued March 31, 1998; U.S. Pat. No. 5,498,530, issued March 12, 
1996; U.S. Pat. No. 5,432,018, issued July 11, 1995; U.S. Pat. No. 5,338,665, 
issued August 16, 1994; U.S. Pat. No. 5,922,545, issued July 13, 1999; WO 
96/40987, published December 19, 1996; and WO 98/15833, published 
April 16, 1998 (each of which is incorporated by reference). In such 

1 0 libraries, random peptide sequences are displayed by fusion with coat 
proteins of filamentous phage. Typically, the displayed peptides are 
affinity-eluted against an antibody-immobilized extracellular domain of a 
receptor. The retained phages may be enriched by successive rounds of 
affinity purification and repropagation. The best binding peptides may be 

15 sequenced to identify key residues within one or more structurally related 
families of peptides. See, e.g., Cwirla et ai (1997), Science 276: 1696-9, in 
which two distinct families were identified. The peptide sequences may 
also suggest which residues may be safely replaced by alanine scanning or 
by mutagenesis at the DNA level. Mutagenesis libraries may be created 

2 0 and screened to further optimize the sequence of the best binders. 
Lowman (1997), Ann. Rev. Biophys. Biomol. Struct. 26: 401-24. 

Other methods compete with phage display in peptide research. A 
peptide library can be fused to the carboxyl terminus of the lac repressor 
and expressed in E. coli . Another E. coli -based method allows display on 

2 5 the cell's outer membrane by fusion with a peptidoglycan-associated 

lipoprotein (PAL). Hereinafter, these and related methods are collectively 
referred to as "E. coli display." Another biological approach to screening 
soluble peptide mixtures uses yeast for expression and secretion. See 
Smith etal. (1993), Mol. Pharmacol. 43: 741-8. Hereinafter, the method of 
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Smith et al . and related methods are referred to as "yeast-based screening." 
In another method, translation of random RNA is halted prior to ribosome 
release, resulting in a library of polypeptides with their associated RNA 
still attached. Hereinafter, this and related methods are collectively 
5 referred to as "ribosome display." Other methods employ chemical linkage 
of peptides to RNA; see, for example, Roberts & Szostak (1997), Proc. Natl. 
Acad. ScL USA, 94: 12297-303. Hereinafter, this and related methods are 
collectively referred to as "RNA-peptide screening." Chemically derived 
peptide libraries have been developed in which peptides are immobilized 

10 on stable, non-biological materials, such as polyethylene rods or solvent- 
permeable resins. Another chemically derived peptide library uses 
photolithography to scan peptides immobilized on glass slides. 
Hereinafter, these and related methods are collectively referred to as 
"chemical-peptide screening." Chemical-peptide screening may be 

1 5 advantageous in that it allows use of D-amino acids and other unnatural 
analogues, as well as non-peptide elements. Both biological and chemical 
methods are reviewed in Wells & Lowman (1992), Curr. Opin. Biotechnol. 
3: 355-62. 

In the case of known bioactive peptides, rational design of peptide 
2 0 ligands with favorable therapeutic properties can be completed. In such 
an approach, one makes stepwise changes to a peptide sequence and 
determines the effect of the substitution upon bioactivity or a predictive 
biophysical property of the peptide (e.g., solution structure). Hereinafter, 
these techniques are collectively referred to as "rational design." In one 
2 5 such technique, one makes a series of peptides in which one replaces a 

single residue at a time with alanine. This technique is commonly referred 
to as an "alanine walk" or an "alanine scan." When two residues 
(contiguous or spaced apart) are replaced, it is referred to as a "double 
alanine walk." The resultant amino acid substitutions can be used alone or 
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in combination to result in a new peptide entity with favorable therapeutic 
properties. 

Structural analysis of protein-protein interaction may also be used 
to suggest peptides that mimic the binding activity of large protein 
5 ligands. In such an analysis, the crystal structure may suggest the identity 
and relative orientation of critical residues of the large protein ligand, 
from which a peptide may be designed. See, e.g., Takasaki et al. (1997), 
Nature Biotech. 15: 1266-70. Hereinafter, these and related methods are 
referred to as "protein structural analysis." These analytical methods may 

1 0 also be used to investigate the interaction between a receptor protein and 
peptides selected by phage display, which may suggest further 
modification of the peptides to increase binding affinity. 

Conceptually, one may discover peptide mimetics of any protein 
using phage display and the other methods mentioned above. These 

15 methods have been used for epitope mapping, for identification of critical 
amino acids in protein-protein interactions, and as leads for the discovery 
of new therapeutic agents. E.g., Cortese et aL (1996), Curr. Opin. Biotech. 7: 
616-21. Peptide libraries are now being used most often in immunological 
studies, such as epitope mapping. Kreeger (1996), The Scientist 10(13): 19- 

2 0 20. 

Of particular interest here is use of peptide libraries and other 
techniques in the discovery of pharmacologically active peptides. A 
number of such peptides identified in the art are summarized in Table 2. 
The peptides are described in the listed publications, each of which is 
2 5 hereby incorporated by reference. The pharmacologic activity of the 

peptides is described, and in many instances is followed by a shorthand 
term therefor in parentheses. Some of these peptides have been modified 
(e.g., to form C-terminally cross-linked dimers). Typically, peptide 
libraries were screened for binding to a receptor for a pharmacologically 
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active protein (e.g., EPO receptor). In at least one instance (CTLA4), the 
peptide library was screened for binding to a monclonal antibody. 



Table 2 — Pharmacologically active peptides 



Form of 


Binding 
partner/ 


Pharmacologic 


Reference 


peptide 


protein of 
interest 


activity 




intrapeptide 
disulfide- 
Donaeu 


EPO receptor 


EPO-mimetic 


Wrighton et al. (1996), 
Science 273: 458-63: 

issued June 30, 1998 to 
Wrighton et al. 


C-terminally 
cross-linked 
dimer 


EPO receptor 


EPO-mimetic 


Livnah et al. (1996), 
Science 273: 464-71 : 
Wrighton et ai. (1997), 
i Nature DioieC/nnoiuyy io. 
1261-5; International 
patent application WO 

Dec. 19, 1996 


linear 


EPO receptor 


EPO-mimetic 


Naranda et al. (1999), 
Proc. Natl. Acad. Sci. 
USA, 96: 7569-74; WO 

September 23, 1999 


linear 


c-MpI 


TPO-mimetic 


Cwirla et al.(1997) 
Science 276: 1696-9: 
U.S. Pat. No. 5,869,451, 
issued Feb. 9, 1999; U.S. 
Pat. No. 5,932,946, 
issued Aug. 3, 1999 


O -t q i* m i n a I lv 
\_/ ici 1 1 in icuiy 

cross-linked 
dimer 




TPO-mimptir 1 

1 I W 1 1 III 1 IC? LI W 


V_/ V V Hid t? L dl. v \ yJ^J t J i 

Science 276: 1696-9 


disulfide- 
linked dimer 




stimulation of 
hematopoiesis 
("G-CSF-mimetic") 


Paukovits et al. (1984), 
Hoppe-Sevlers Z. 
Phvsiol. Chem. 365: 303- 
11; Laerum et al. (1988), 
Exp. Hemat. 16: 274-80 


alkylene- 
linked dimer 




G-CSF-mimetic 


Bhatnagar et al. (1996), 
J. Med. Chem. 39: 3814- 
9; Cuthbertson et al. 
11997V J. Med. Chem. 
40: 2876-82; King et al. 
M991). Exp. Hematol. 
19:481; King et al. 
M995V Blood 86 (Suppl. 



a The protein listed in this column may be bound by the associated peptide (e.g., EPO 
receptor, IL-1 receptor) or mimicked by the associated peptide. The references listed for 
each clarify whether the molecule is bound by or mimicked by the peptides. 
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1):309a 



linear 


IL-1 receptor 


inflammatory and 
autoimmune diseases 
("IL-1 antagonist" or 
"IL-1ra-mimetic") 


U.S. Pat. No. 5,608,035; 
U.S. Pat. No. 5,786,331; 
U.S. Pat. No. 5,880,096; 
Yanofsky et al. (1996), 
Proc. Natl. Acad. Sci. 93: 
7381-6; Akeson et al. 
(1996), J. Biol. Chem. 
271: 30517-23; 
Wiekzorek et al. (1997), 
Pol. J. Pharmacol. 49: 
107-17; Yanofsky (1996), 
PNAs, 93:7381-7386. 


linear 


Facteur 
thymique 
serique (FTS) 


stimulation of lymphocytes 
("FTS-mimetic") 


Inagaki-Ohara et al. 
(1996), Cellular Immunol. 
171:30-40; Yoshida 
(1984), Int. J. 
Immunooharmacol. 
6:141-6. 


intrapeptide 
disulfide 
bonded 


CTLA4 MAb 


CTLA4-mimetic 


Fukumoto et al. (1998), 
Nature Biotech. 16: 267- 
70 


exocyclic 


TNF-a receptor 


TNF-a antagonist 


Takasaki et aL (1 997), 
Nature Biotech. 15:1266- 
70; WO 98/53842, 
published December 3, 
1998 


linear 


TNF-a receptor 


TNF-a antagonist 


Chirinos-Rojas ( ), J. 
I mm., 5621-5626. 


intrapeptide 
disulfide 
bonded 


C3b 


inhibition of complement 
activation; autoimmune 
diseases 
("C3b-antagonist") 


Sahu et al. (1996), J. 
Immunol. 157: 884-91: 
Morikis et al. (1998), 
Protein Sci. 7: 61 9-27 


linear 


vinculin 


cell adhesion processes — 
cell growth, differentiation, 
wound healing, tumor 
metastasis ("vinculin 
binding") 


Adey et al. (1997), 
Biochem. J. 324: 523-8 


linear 


C4 binding 
protein (C4BP) 


antithrombotic 


Linse et al. (1997), J. 
Biol. Chem. 272: 14658- 
65 


linear 


urokinase 
receptor 


processes associated with 
urokinase interaction with 

its receptor (e.g., 
angiogenesis, tumor cell 
invasion and metastasis); 
("UKR antagonist") 


Goodson et aL (1994), 
Proc. Natl. Acad. Sci. 91: 
7129-33; Internationa! 
application WO 
97/35969, published 
October 2, 1997 


linear 


Mdm2, Hdm2 


- Inhibition of inactivation of 
p53 mediated by Mdm2 or 
hdm2; anti-tumor 
("Mdm/hdm antagonist") 


Picksfey et al. (1994), 
Oncoaene 9: 2523-9; 
Bottger et al. (1997) J. 
Mol. Biol. 269: 744-56; 
Bottaer et al. (1996), 



b FTS is a thymic hormone mimicked by the molecule of this invention rather than a 
receptor bound by the molecule of this invention. 
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Oncogene 13: 2141-7 



linear 


p21 WAF1 


anti-tumor by mimicking 
the activity of p21 WAF1 


Bail et al. (1997). Curr. 
Biol. 7: 71-80 


linear 


farnesyl 
transferase 


anti-cancer by preventing 
activation of ras oncogene 


Gibbs et al. (1994), Cell 
77:175-178 


linear 


Ras effector 
domain 


anti-cancer by inhibiting 
biological function of the 
ras oncogene 


Moodie et al. (1994), 
Trends Genet 1 0: 44-48 
Rodriguez et ai. (1994), 
Nature 370:527-532 


linear 


SH2/SH3 
domains 


anti-cancer by inhibiting 
tumor growth with 
activated tyrosine kinases; 
treatment of SH3- 

mediated disease states 
("SH3 antagonist") 


Pawson et a! (1993), 
Curr. Biol. 3:434-432 
Yu et al. (1994), Cell 
76:933-945; Rickies et al. 
(1994). EMBO J. 13: 
5598-5604; Sparks et al. 
(1994). J. Biol. Chem. 
269: 23853-6; Sparks et 
al. (1996), Proc. Natl. 
Acad. Sci. 93: 1540-4: 
US Pat. No. 5,886,150, 
issued March 23, 1999; 
US Pat. No. 5,888,763, 
issued March 30, 1999 


linear 


P 16 ,NK4 


anti-cancer by mimicking 
activity of p16; e.g., 
inhibiting cyclin D-Cdk 
complex ( p16-mimetic ) 


F^hraeus etal. (1996), 
Curr. Biol. 6:84-91 


linear 


Src, Lyn 


inhibition of Mast cell 
activation, IgE-related 

conditions, type I 
hypersensitivity ("Mast 

cell antagonist") 


Stauffer et al. (1997), 
Biochem. 36: 9388-94 


linear 


Mast cell 
protease 


treatment of inflammatory 
disorders mediated by 
release of tryptase-6 
("Mast cell protease 
inhibitors") 


International application 
WO 98/33812, published 
August 6, 1998 


linear 


HBV core 
antigen (HBcAg) 


treatment of HBV viral 
infections ("anti-HBV") 


Dyson & Muray (1995), 
Proc. Natl. Acad. Sci. 92: 
2194-8 


linear 


selectins 


neutrophil adhesion; 
inflammatory diseases 
("seiectin antagonist") 


Martens et al. (1995), J. 
Biol. Chem. 270: 21129- 
36; European patent 
application EP 0 714 
912, published June 5, 
1996 


linear, 
cyclized 


calmodulin 


calmodulin antagonist 


Pierce et al. (1995), 
Molec. Diversity 1 : 259- 
65; Dedman et al. 
(1993). J. Biol. Chem. 
268: 23025-30; Adey & 
Kay (1996), Gene 169: 
133-4 


linear, 
cyclized- 


integrins 


tumor-homing; treatment 
for conditions related to 


International applications 
WO 95/14714, published 
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integrin-mediated cellular 


June 1, 1995; WO 


events, including platelet 


97/08203, published 


aggregation, thrombosis, 


March 6, 1997; WO 


wound healing, 


98/1 0795, published 


osteoporosis, tissue 


March 19, 1998; WO 


repair, angiogenesis (e.g., 


99/24462, published May 


for treatment of cancer), 


20, 1999; Kraft et al. 


and tumor invasion 


(1999), J. Biol. Chem. 


("integrin-binding") 


274: 1979-1985 



cyclic, linear 


fibronectin and 
extracellular 
matrix 
components of T 
cells and 
macrophages 


treatment of inflammatory 
and autoimmune 
conditions 


WO 98/09985, published 
March 12, 1998 


linear 


somatostatin 
and cortistatin 


treatment or prevention of 
hormone-producing 
tumors, acromegaly, 
giantism, dementia, 
gastric ulcer, tumor 
growth, inhibition of 
hormone secretion, 
i nuuuicuion ot sieep or 
neural activity 


European patent 
application 0 91 1 393, 
published April 28, 1999 


linear 


uacienai 
Iipopolysac- 
charide 


antiDiotic, sepxic snocK^ 
disorders modulatable by 
CAP37 


u.o. Kat. IMO. o,o/ / , lot, 
issued March 2, 1999 


linear or 
cyclic, 

II IU1UUII iy U" 

amino acids 


pardaxin, mellitin 


antipathogenic 


WO 97/31019, published 
28 August 1997 


linear, cyclic 


VIP 


impotence, 
neurodegenerative 
uisoruers 


WO 97/40070, published 
October 30, 1997 


III IfcJcU 


v_/ 1 l_S 


cancer 


tr u / /u o^i4, puDiisneo 
Mav 2 1 997 


linear 


THF-gamma2 




Burnstein (1988), 
Biochem., 27:4066-71. 


linear 


Amyiin 




Cooper (1987). Proc. 
Natl. Acad. Sci.. 
84:8628-32. 


linear 


Adrenomedullin 




Kitamura (1993). BBRC. 
192:553-60. 


cyclic, linear 


VEGF 


anti-angiogenic; cancer, 
rheumatoid arthritis, 
diabetic retinopathy, 
psoriasis ("VEGF 
antagonist") 


Fairbrother (1998), 
Biochem., 37:17754- 
17764. 


cyclic 


MMP 


inflammation and 
autoimmune disorders; 
tumor growth 
("MMP inhibitor") 


Koivunen (1999), Nature 
Biotech.. 17:768-774. 




HGH fragment 


treatment of obesity 


U.S. Pat. No. 5,869,452 




Echistatin 


inhibition of platelet 


Gan (1988), J. Biol. 
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aggregation 


Chem.. 263:19827-32. 


linear 


SLE 
autoantibody 


SLE 


WO 96/30057, published 
October 3, 1 996 




GD1 alpha 


suppression of tumor 
metastasis 


Ishikawa et at. (1998), 
FEBS Lett. 441 (1): 20-4 




antiphospholipid 
beta-2- 
glycoprotein-l 
(P2GPI) 
antibodies 


endothelial cell activation , 
antiphospholipid 
syndrome (APS), 
thromboembolic 
phenomena, 
thrombocytopenia, and 
recurrent feta! loss 


Blank et al. (1999). Proc. 
Natl. Acad. Sci. USA 96: 
5164-8 


linear 


T Cell Receptor 
beta chain 


diabetes 


WO 96/11214, published 
April 18, 1996. 






Antiproliferative, antiviral 


WO 00/01402, published 
January 13, 2000. 






anti-ischemic, growth 
hormone-liberating 


WO 99/62539, published 
December 9, 1999. 






anti-angiogenic 


WO 99/61476, published 
December 2, 1 999. 


linear 




Apoptosis agonist; 
treatment of T cell- 
associated disorders (e.g., 
autoimmune diseases, 
viral infection, T cell 
leukemia, T cell 
lymphoma) 


WO 99/38526, published 
Aug. 5, 1999. 


linear 


MHC class II 


treatment of autoimmune 
diseases 


US Pat. No. 5,880,103, 
issued March 9, 1999. 


linear 


androgen R, 
p75, MJD, DCC, 
huntingtin 


proapoptotic, useful in 
treating cancer 


WO 99/45944, published 
September 16, 1999. 


linear 


von Willebrand 
Factor; Factor 
VIII 


inhibition of Factor VIII 
interaction; anticoagulants 


WO 97/41220, published 
April 29, 1997. 


linear 


lentivirus LLP1 


antimicrobial 


US Pat. No. 5,945,507, 
issued Aug. 31, 1999. 


linear 


Delta-Sleep 
Inducing Peptide 


sleep disorders 


Graf (1986), Peptides 
7:1165. 


linear 


C- Reactive 
Protein (CRP) 


inflammation and cancer 


Barna (1994). Cancer 
Immunol. Immunother. 
38:38 (1994). 


linear 


Sperm- 
Activating 

D K>+ ! f~\ CS o 

nepnues 


infertility 


Suzuki (1992). Comp. 
Biochem. Phvsiol. 


linear 


angiotensins 


hematopoietic factors for 

hematocytopenic . 
conditions from cancer, 
AIDS, etc. 


Lundergan (1999), J, 
Periodontal Res. 
34(4):223-228. 


linear 


HIV-1 gp41 


anti-AlDS 


Chan (1998), Ceil 
93:681-684. 


linear 


PKC 


inhibition of bone 
resorption 


Moonga (1998), Exp. 
Phvsiol. 83:717-725. 


linear 


defensins (HNP- 


antimicrobial 


Harvig (1994V Methods 
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1,-2, -3, -4) 




Enz. 236:160-172. 


Ml 1C7CU 


erbB-2 


AH NP-mi motto "s»nti-ti imnr 
nniNr 1 1 in i iciUj.ai m iui I iui 


Park (90C)n\ Wat 

Biotechnol. 18:194-198. 


linear 


gp130 


IL-6 antagonist 


WO 99/60013, published 
Nov. 25, 1.999. 


ill leal 


uuiidymi, uu it?i 

joint, cartilage, 
arthritis-related 
proteins 


clULUII I II I lUl It? UlotfdotJo 


WO niihlichorl 
vvu yy/outOt, {juuiibi tcu 

Oct. 7, 1999. 


linear 


HIV-1 envelope 
protein 


treatment of neurological 
degenerative diseases 


WO 99/51254, published 
Oct. 14, 1999. 


linear 


IL-2 


autoimmune disorders 
(e.g., graft rejection, 
rheumatoid arthritis) 


WO 00/04048, published 
Jan. 27, 2000; WO 
00/11028, published 
March 2, 2000. 



Peptides identified by peptide library screening have been regarded 
as "leads" in development of therapeutic agents rather than as therapeutic 
agents themselves. Like other proteins and peptides, they would be 
5 rapidly removed in vivo either by renal filtration, cellular clearance 

mechanisms in the reticuloendothelial system, or proteolytic degradation. 
Francis (1992), Focus on Growth Factors 3: 4-11. As a result, the art 
presently uses the identified peptides to validate drug targets or as 
scaffolds for design of organic compounds that might not have been as 
1 0 easily or as quickly identified through chemical library screening. 

Lowman (1997), Ann. Rev. Biophys. Biomol. Struct. 26: 401-24; Kay etal. 
(1998), Drug Disc. Today 3: 370-8. The art would benefit from a process by 
which such peptides could more readily yield therapeutic agents. 

Summary of the Invention 
1 5 The present invention concerns a process by which the in vivo half- 

life of one or more biologically active peptides is increased by fusion with 
a vehicle. In this invention, pharmacologically active compounds are 
prepared by a process comprising: 

a) selecting at least one peptide that modulates the activity of a 
2 0 protein of interest; and 
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b) preparing a pharmacologic agent comprising at least one 

vehicle covalently linked to at least one amino acid sequence 
of the selected peptide. 
The preferred vehicle is an Fc domain. The peptides screened in step (a) 
5 are preferably expressed in a phage display library. The vehicle and the 
peptide may be linked through the N- or C-terminus of the peptide or the 
vehicle, as described further below. Derivatives of the above compounds 
(described below) are also encompassed by this invention. 

The compounds of this invention may be prepared by standard 
10 synthetic methods, recombinant DNA techniques, or any other methods of 
preparing peptides and fusion proteins. Compounds of this invention that 
encompass non-peptide portions may be synthesized by standard organic 
chemistry reactions, in addition to standard peptide chemistry reactions 
when applicable. 

1 5 The primary use contemplated is as therapeutic or prophylactic 

agents. The vehicle-linked peptide may have activity comparable to — or 
even greater than — the natural ligand mimicked by the peptide. In 
addition, certain natural ligand-based therapeutic agents might induce 
antibodies against the patient's own endogenous ligand; the vehicle-linked 

2 0 peptide avoids this pitfall by having little or typically no sequence identity 
with the natural ligand. 

Although mostly contemplated as therapeutic agents, compounds 
of this invention may also be useful in screening for such agents. For 
example, one could use an Fc-peptide (e.g., Fc-SH2 domain peptide) in an 

2 5 assay employing anti-Fc coated plates. The vehicle, especially Fc, may 
make insoluble peptides soluble and thus useful in a number of assays. 

The compounds of this invention may be used for therapeutic or 
prophylactic purposes by formulating them with appropriate 
pharmaceutical carrier materials and administering an effective amount to 
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a patient, such as a human (or other mammal) in need thereof. Other 
related aspects are also included in the instant invention. 

Numerous additional aspects and advantages of the present 
invention will become apparent upon consideration of the figures and 
5 detailed description of the invention. 

Brief Description of the Figures 
Figure 1 shows a schematic representation of an exemplary process 
of the invention. In this preferred process, the vehicle is an Fc domain, 
which is linked to the peptide covalently by expression from a DNA 
1 0 construct encoding both the Fc domain and the peptide* As noted in 
Figure 1, the Fc domains spontaneously form a dimer in this process. 

Figure 2 shows exemplary Fc dimers that may be derived from an 
IgGl antibody. "Fc" in the figure represents any of the Fc variants within 
the meaning of "Fc domain" herein. "X 1 " and "X 2 " represent peptides or 
15 linker-peptide combinations as defined hereinafter. The specific dimers are 
as follows: 

A, D: Single disulfide-bonded dimers. IgGl antibodies typically 
have two disulfide bonds at the hinge region between the constant and 
variable domains. The Fc domain in Figures 2A and 2 D may be formed by 

2 0 truncation between the two disulfide bond sites or by substitution of a 
cysteinyl residue with an unreactive residue (e.g., alanyl). In Figure 2A, 
the Fc domain is linked at the amino terminus of the peptides; in 2D, at the 
carboxyl terminus. 

B, E: Doubly disulfide-bonded dimers. This Fc domain may be 
2 5 formed by truncation of the parent antibody to retain both cysteinyl 

residues in the Fc domain chains or by expression from a construct 
including a sequence encoding such an Fc domain. In Figure 2B, the Fc 
domain is linked at the amino terminus of the peptides; in 2E, at the 
carboxyl terminus. 
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C, F: Noncovalent dimers. This Fc domain may be formed by 
elimination of the cysteinyl residues by either truncation or substitution. 
One may desire to eliminate the cysteinyl residues to avoid impurities 
formed by reaction of the cysteinyl residue with cysteinyl residues of other 
5 proteins present in the host cell. The noncovalent bonding of the Fc 
domains is sufficient to hold together the dimer. 

Other dimers may be formed by using Fc domains derived from different 
types of antibodies (e.g., IgG2, IgM). 

Figure 3 shows the structure of preferred compounds of the 

10 invention that feature tandem repeats of the pharmacologically active 

peptide. Figure 3A shows a single chain molecule and may also represent 
the DNA construct for the molecule. Figure 3B shows a dimer in which the 
linker-peptide portion is present on only one chain of the dimer. Figure 3C 
shows a dimer having the peptide portion on both chains. The dimer of 

1 5 Figure 3C will form spontaneously in certain host cells upon expression of 
a DNA construct encoding the single chain shown in Figure 3A. In other 
host cells, the cells could be placed in conditions favoring formation of 
dimers or the dimers can be formed in vitro . 

Figure 4 shows exemplary nucleic acid and amino acid sequences 

2 0 (SEQ ID NOS: 1 and 2, respectively) of human IgGl Fc that may be used in 
this invention. 

Figure 5 shows a synthetic scheme for the preparation of PEGylated 
peptide 19 (SEQ ID NO: 3). 

Figure 6 shows a synthetic scheme for the preparation of PEGylated 
2 5 peptide 20 (SEQ ID NO: 4). 

Figure 7 shows the nucleotide and amino acid sequences (SEQ ID 
NOS: 5 and 6, respectively) of the molecule identified as "Fc-TMP" in 
Example 2 hereinafter. 
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Figure 8 shows the nucleotide and amino acid sequences (SEQ. ID. 
NOS: 7 and 8, respectively) of the molecule identified as "Fc-TMP-TMP" in 
Example 2 hereinafter. 

Figure 9 shows the nucleotide and amino acid sequences (SEQ. ID. 
5 NOS: 9 and 10, respectively) of the molecule identified as "TMP-TMP-Fc" 
in Example 2 hereinafter. 

Figure 10 shows the nucleotide and amino acid sequences (SEQ. ID. 
NOS: 11 and 12, respectively) of the molecule identified as "TMP-Fc" in 
Example 2 hereinafter. 
1 0 Figure 11 shows the number of platelets generated in vivo in 

normal female BDF1 mice treated with one 100 ju,g/kg bolus injection of 
various compounds, with the terms defined as follows. 

PEG-MGDF: 20 kD average molecular weight PEG attached by 
reductive amination to the N-terminal amino group of amino 
15 acids 1-163 of native human TPO, which is expressed in E. coli 

(so that it is not glycosylated); 
TMP: the TPO-mimetic peptide having the amino acid sequence 

IEGPTLRQWLAARA (SEQ ID NO: 13); 
TMP-TMP: the TPO-mimetic peptide having the amino acid 
2 0 sequence IEGPTLRQWLAARA-GGGGGGGG- 

IEGPTLRQWLAARA (SEQ ID NO: 14); 
PEG-TMP-TMP: the peptide of SEQ ID NO: 14, wherein the PEG 
group is a 5 kD average molecular weight PEG attached as 
shown in Figure 6; 

2 5 Fc-TMP-TMP: the compound of SEQ ID NO: 8 (Figure 8) dimerized 

with an identical second monomer (i.e., Cys residues 7 and 10 
are bound to the corresponding Cys residues in the second 
monomer to form a dimer, as shown in Figure 2); and 
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TMP-TMP-Fc is the compound of SEQ ID NO: 10 (Figure 9) 

dimerized in the same way as TMP-TMP-Fc except that the Fc 
domain is attached at the C-terminal end rather than the N- 
terminal end of the TMP-TMP peptide. 
5 Figure 12 shows the number of platelets generated in vivo in 

normal BDF1 mice treated with various compounds delivered via 
implanted osmotic pumps over a 7-day period. The compounds are as 
defined for Figure 7. 

Figure 13 shows the nucleotide and amino acid sequences (SEQ. ID. 
10 NOS: 15 and 16, respectively) of the molecule identified as "Fc-EMP" in 
Example 3 hereinafter, 

Figure 14 shows the nucleotide and amino acid sequences (SEQ ID 
NOS: 17 and 18, respectively) of the molecule identified as // EMP-Fc // in 
Example 3 hereinafter. 
15 Figure 15 shows the nucleotide and amino acid sequences (SEQ ID 

NOS:19 and 20, respectively) of the molecule identified as "EMP-EMP-Fc" 
in Example 3 hereinafter. 

Figure 16 shows the nucleotide and amino acid sequences (SEQ ID 
NOS: 21 and 22, respectively) of the molecule identified as "Fc-EMP-EMP" 
2 0 in Example 3 hereinafter. 

Figures 17A and 17B show the DNA sequence (SEQ ID NO: 23) 
inserted into pCFM1656 between the unique Aat ll (position #4364 in 
pCFM1656) and SacI I (position #4585 in pCFM1656) restriction sites to 
form expression plasmid pAMG21 (ATCC accession no. 98113). 
2 5 Figure 18A shows the hemoglobin, red blood cells, and hematocrit 

generated in vivo in normal female BDF1 mice treated with one 100 |^g/kg 
bolus injection of various compounds. Figure 18B shows the same results 
with mice treated with 100 jtcg/kg per day delivered by 7-day micro- 
osmotic pump with the EMPs delivered at 100 ^g/kg, rhEPO at 
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SOU /mouse. (In both experiments, neutrophils, lymphocytes, and platelets 
were unaffected.) In these figures, the terms are defined as follows. 

Fc-EMP: the compound of SEQ ID NO: 16 (Figure 13) dimerized 
with an identical second monomer (i.e., Cys residues 7 and 10 are 
5 bound to the corresponding Cys residues in the second monomer to 

form a dimer, as shown in Figure 2); 

EMP-Fc: the compound of SEQ ID NO: 18 (Figure 14) dimerized in 
the same way as Fc-EMP except that the Fc domain is attached at 
the C-terminal end rather than the N-terminal end of the EMP 

10 peptide. 

"EMP-EMP-Fc" refers to a tandem repeat of the same peptide (SEQ 
ID NO: 20) attached to the same Fc domain by the carboxyl 
terminus of the peptides. "Fc-EMP-EMP" refers to the same tandem 
repeat of the peptide but with the same Fc domain attached at the 

1 5 amino terminus of the tandem repeat. All molecules are expressed 

in E. coli and so are not glycosylated. 

Figures 19A and 19B show the nucleotide and amino acid sequences 
(SEQ ID NOS: 1055 and 1056) of the Fc-TNF-oc inhibitor fusion molecule 
described in Example 4 hereinafter^ 
2 0 Figures 20A and 20B show the nucleotide and amino acid sequences 

(SEQ ID NOS: 1057 and 1058) of the TNF-ce inhibitor-Fc fusion molecule 
described in Example 4 hereinafter. 

Figures 21A and 21B show the nucleotide and amino acid sequences 
(SEQ ID NOS: 1059 and 1060) of the Fc-IL-1 antagonist fusion molecule 
2 5 described in Example 5 hereinafter. 

Figures 22A and 22B show the nucleotide and amino acid sequences 
(SEQ ID NOS: 1061 and 1062) of the IL-1 antagonist-Fc fusion molecule 
described in Example 5 hereinafter. 
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Figures 23A, 23B, and 23C show the nucleotide and amino acid 
sequences (SEQ ID NOS: 1063 and 1064) of the Fc-VEGF antagonist fusion 
molecule described in Example 6 hereinafter. 

Figures 24A and 24B show the nucleotide and amino acid sequences 
5 (SEQ ID NOS: 1065 and 1066) of the VEGF antagonist-Fc fusion molecule 
described in Example 6 hereinafter. 

Figures 25A and 25B show the nucleotide and amino acid sequences 
(SEQ ID NOS: 1067 and 1068) of the Fc-MMP inhibitor fusion molecule 
described in Example 7 hereinafter. 
1 0 Figures 26A and 26B show the nucleotide and amino acid sequences 

(SEQ ID NOS: 1069 and 1070) of the MMP inhibitor-Fc fusion molecule 
described in Example 7 hereinafter. 

Detailed Description of the Invention 
Definition of Terms 
1 5 The terms used throughout this specification are defined as follows, 

unless otherwise limited in specific instances. 

The term "comprising" means that a compound may include 
additional amino acids on either or both of the N- or C- termini of the 
given sequence. Of course, these additional amino acids should not 
2 0 significantly interfere with the activity of the compound. 

The term "vehicle" refers to a molecule that prevents degradation 
and /or increases half-life, reduces toxicity, reduces immunogenicity, or 
increases biological activity of a therapeutic protein. Exemplary vehicles 
include an Fc domain (which is preferred) as well as a linear polymer (e.g., 
2 5 polyethylene glycol (PEG), polylysine, dextran, etc.); a branched-chain 
polymer (see, for example, U.S. Patent No. 4,289,872 to Denkenwalter et 
al., issued September 15, 1981; 5,229,490 to Tarn, issued July 20, 1993; WO 
93/21259 by Frechet etal., published 28 October 1993); a lipid; a 
cholesterol group (such as a steroid); a carbohydrate or oligosaccharide; or 
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any natural or synthetic protein, polypeptide or peptide that binds to a 
salvage receptor. Vehicles are further described hereinafter. 

The term "native Fc" refers to molecule or sequence comprising the 
sequence of a non-antigen-binding fragment resulting from digestion of 
5 whole antibody, whether in monomeric or multimeric form. The original 
immunoglobulin source of the native Fc is preferably of human origin and 
may be any of the immunoglobulins, although IgGl and IgG2 are 
preferred. Native Fc's are made up of monomeric polypeptides that may 
be linked into dimeric or multimeric forms by covalent (i.e., disulfide 

1 0 bonds) and non-covalent association. The number of intermolecular 
disulfide bonds between monomeric subunits of native Fc molecules 
ranges from 1 to 4 depending on class (e.g., IgG / IgA, IgE) or subclass (e.g., 
IgGl, IgG2, IgG3, IgAl, IgGA2). One example of a native Fc is a disulfide- 
bonded dimer resulting from papain digestion of an IgG (see Ellison etal. 

15 (1982), Nucleic Acids Res . 10: 4071-9). The term "native Fc" as used herein 
is generic to the monomeric, dimeric, and multimeric forms. 

The term "Fc variant" refers to a molecule or sequence that is 
modified from a native Fc but still comprises a binding site for the salvage 
receptor, FcRn. International applications WO 97/34631 (published 25 

2 0 September 1997) and WO 96/32478 describe exemplary Fc variants, as 

well as interaction with the salvage receptor, and are hereby incorporated 
by reference. Thus, the term "Fc variant" comprises a molecule or 
sequence that is humanized from a non-human native Fc. Furthermore, a 
native Fc comprises sites that may be removed because they provide 

2 5 structural features or biological activity that are not required for the fusion 
molecules of the present invention. Thus, the term "Fc variant" comprises 
a molecule or sequence that lacks one or more native Fc sites or residues 
that affect or are involved in (1) disulfide bond formation, (2) 
incompatibility with a selected host cell (3) N-terminal heterogeneity upon 
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expression in a selected host cell, (4) glycosylation, (5) interaction with 
complement, (6) binding to an Fc receptor other than a salvage receptor, or 
(7) antibody-dependent cellular cytotoxicity (ADCC). Fc variants are 
described in further detail hereinafter. 
5 The term "Fc domain" encompasses native Fc and Fc variant 

molecules and sequences as defined above. As with Fc variants and native 
Fc's, the term "Fc domain" includes molecules in monomeric or 
multimeric form, whether digested from whole antibody or produced by 
other means. 

10 The term "mul timer" as applied to Fc domains or molecules 

comprising Fc domains refers to molecules having two or more 
polypeptide chains associated covalently, noncovalently, or by both 
covalent and non-covalent interactions. IgG molecules typically form 
dimers; IgM, pentamers; IgD, dimers; and IgA, monomers, dimers, 

15 trimers, or tetramers. Multimers may be formed by exploiting the 

sequence and resulting activity of the native Ig source of the Fc or by 
derivatizing (as defined below) such a native Fc. 

The term "dimer" as applied to Fc domains or molecules 
comprising Fc domains refers to molecules having two polypeptide chains 

2 0 associated covalently or non-covalently. Thus, exemplary dimers within 
the scope of this invention are as shown in Figure 2. 

The terms "derivatizing" and "derivative" or "derivatized" 
comprise processes and resulting compounds respectively in which (1) the 
compound has a cyclic portion; for example, cross-linking between 

25 cysteinyl residues within the compound; (2) the compound is cross-linked 
or has a cross-linking site; for example, the compound has a cysteinyl 
residue and thus forms cross-linked dimers in culture or in vivo; (3) one or 
more peptidyl linkage is replaced by a non-peptidyl linkage; (4) the NT- 
terminus is replaced by -NRR 1 , NRC^R 1 , -NRQOJOR 1 , -NRSCO^R 1 , - 
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NHC(0)NHR, a succinimide group, or substituted or unsubstituted 
benzyloxycarbonyl-NH-, wherein R and R 1 and the ring substituents are 
as defined hereinafter; (5) the C-terminus is replaced by -C(0)R 2 or -NR 3 R 4 
wherein R 2 , R 3 and R 4 are as defined hereinafter; and (6) compounds in 
5 which individual amino acid moieties are modified through treatment 
with agents capable of reacting with selected side chains or terminal 
residues. Derivatives are further described hereinafter. 

The term "peptide" refers to molecules of 2 to 40 amino acids, with 
molecules of 3 to 20 amino acids preferred and those of 6 to 15 amino acids 

1 0 most preferred. Exemplary peptides may be randomly generated by any 
of the methods cited above, carried in a peptide library (e.g., a phage 
display library), or derived by digestion of proteins. 

The term "randomized" as used to refer to peptide sequences refers 
to fully random sequences (e.g., selected by phage display methods) and 

15 sequences in which one or more residues of a naturally occurring molecule 
is replaced by an amino acid residue not appearing in that position in the 
naturally occurring molecule. Exemplary methods for identifying peptide 
sequences include phage display, E. coli display, ribosome display, yeast- 
based screening, RNA-peptide screening, chemical screening, rational 

2 0 design, protein structural analysis, and the like. 

The term "pharmacologically active" means that a substance so 
described is determined to have activity that affects a medical parameter 
(e.g., blood pressure, blood cell count, cholesterol level) or disease state 
(e.g., cancer, autoimmune disorders). Thus, pharmacologically active 

2 5 peptides comprise agonistic or mimetic and antagonistic peptides as 
defined below. 

The terms "-mimetic peptide" and "-agonist peptide" refer to a 
peptide having biological activity comparable to a protein (e.g., EPO, TPO, 
G-CSF) that interacts with a protein of interest. These terms further 
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include peptides that indirectly mimic the activity of a protein of interest, 
such as by potentiating the effects of the natural ligand of the protein of 
interest; see, for example, the G-CSF-mimetic peptides listed in Tables 2 
and 7. Thus, the term "EPO-mimetic peptide" comprises any peptides that 
5 can be identified or derived as described in Wrighton et al. (1996), Science 
273 : 458-63, Naranda et al. (1999), Proc. Natl. Acad. Sci. USA 96: 7569-74, 
or any other reference in Table 2 identified as having EPO-mimetic subject 
matter. Those of ordinary skill in the art appreciate that each of these 
references enables one to select different peptides than actually disclosed 
1 0 therein by following the disclosed procedures with different peptide 
libraries. 

The term "TPO-ndmetic peptide" comprises peptides that can be 
identified or derived as described in Cwirla et al . (1997), Science 276: 1696- 
9 , U.S. Pat. Nos. 5,869,451 and 5,932,946 and any other reference in Table 2 

15 identifed as having TPO-mimetic subject matter, as well as the U.S. patent 
application, "Thrombopoietic Compounds," filed on even date herewith 
and hereby incorporated by reference. Those of ordinary skill in the art 
appreciate that each of these references enables one to select different 
peptides than actually disclosed therein by following the disclosed 

2 0 procedures with different peptide libraries. 

The term "G-CSF-mimetic peptide" comprises any peptides that 
can be identified or described in Paukovits et al . (1984), Hoppe-Seylers Z. 
Physiol. Chem . 365: 303-11 or any of the references in Table 2 identified as 
having G-CSF-mimetic subject matter. Those of ordinary skill in the art 

25 appreciate that each of these references enables one to select different 
peptides than actually disclosed therein by following the disclosed 
procedures with different peptide libraries. 

The term "CTLA4-mimetic peptide" comprises any peptides that 
can be identified or derived as described in Fukumoto et al . (1998), Nature 
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Biotech . 16: 267-70. Those of ordinary skill in the art appreciate that each of 
these references enables one to select different peptides than actually 
disclosed therein by following the disclosed procedures with different 
peptide libraries. 

5 The term "-antagonist peptide" or "inhibitor peptide" refers to a 

peptide that blocks or in some way interferes with the biological activity of 
the associated protein of interest, or has biological activity comparable to a 
known antagonist or inhibitor of the associated protein of interest. Thus, 
the term "TNF-antagonist peptide" comprises peptides that can be 

1 0 identified or derived as described in Takasaki et al . (1997), Nature Biotech . 
15: 1266-70 or any of the references in Table 2 identified as having TNF- 
antagonistic subject matter. Those of ordinary skill in the art appreciate 
that each of these references enables one to select different peptides than 
actually disclosed therein by following the disclosed procedures with 

1 5 different peptide libraries. 

The terms "IL-l antagonist" and "IL-lra-mimetic peptide" 
comprises peptides that inhibit or down-regulate activation of the IL-l 
receptor by IL-l. IL-l receptor activation results from formation of a 
complex among IL-l, IL-l receptor, and IL-l receptor accessory protein. 

2 0 IL-l antagonist or IL-lra-mimetic peptides bind to IL-l, IL-l receptor, or 
IL-l receptor accessory protein and obstruct complex formation among 
any two or three components of the complex. Exemplary IL-l antagonist 
or IL-lra-mimetic peptides can be identified or derived as described in 
U.S. Pat. Nos. 5,608,035, 5,786,331, 5,880,096, or any of the references in 

2 5 Table 2 identified as having IL-lra-mimetic or IL-l antagonistic subject 
matter. Those of ordinary skill in the art appreciate that each of these 
references enables one to select different peptides than actually disclosed 
therein by following the disclosed procedures with different peptide 
libraries. 



-23- 



WO 01/83525 



PCT/US01/14310 



The term "VEGF-antagonist peptide" comprises peptides that can 
be identified or derived as described in Fairbrother (1998), Biochem. 37: 
17754-64, and in any of the references in Table 2 identified as having 
VEGF-antagonistic subject matter. Those of ordinary skill in the art 
5 appreciate that each of these references enables one to select different 
peptides than actually disclosed therein by following the disclosed 
procedures with different peptide libraries. 

The term "MMP inhibitor peptide" comprises peptides that can be 
identified or derived as described in Koivunen (1999), Nature Biotech. 17: 

1 0 768-74 and in any of the references in Table 2 identified as having MMP 
inhibitory subject matter. Those of ordinary skill in the art appreciate that 
each of these references enables one to select different peptides than 
actually disclosed therein by following the disclosed procedures with 
different peptide libraries. 

15 Additionally, physiologically acceptable salts of the compounds of 

this invention are also encompassed herein. By "physiologically 
acceptable salts" is meant any salts that are known or later discovered to 
be pharmaceutically acceptable. Some specific examples are: acetate; 
trifluoroacetate; hydrohalides, such as hydrochloride and hydrobromide; 

2 0 sulfate; citrate; tartrate; glycolate; and oxalate. 
Structure of compounds 

In General . In the compositions of matter prepared in accordance 
with this invention, the peptide may be attached to the vehicle through the 
peptide's N-terminus or C-terminus. Thus, the vehicle-peptide molecules 
2 5 of this invention may be described by the following formula I: 
I 

(X 1 ) a -F 1 -(X 2 ) b 

wherein: 

F 1 is a vehicle (preferably an Fc domain); 
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X 3 and X 2 are each independently selected from -(L^-P 1 , -(V)-P 1 - 
(L 2 ) d -P 2 , -(L 1 ) c -P 1 -(L 2 ) d -P i! -(LVP 3 / and -(L VPML 2 ) d -P 2 -(L 3 ) e -P^-P 4 

P 1 , P 2 , P 3 , and P 4 are each independently sequences of 
pharmacologically active peptides; 
5 L 1 , L 2 , L 3 , and L 4 are each independently linkers; and 

a, b, c, d, e, and f are each independently 0 or 1, provided that at 
least one of a and b is 1. 

Thus, compound I comprises preferred compounds of the formulae 

II 

io X 1 -F 1 

and multimers thereof wherein F 1 is an Fc domain and is attached at the C- 

terminus of X 1 ; 

III 

F 1 -X 2 

1 5 and multimers thereof wherein F 1 is an Fc domain and is attached at the N- 
terminus of X 2 ; 
IV 

F 1 -(L 1 ) C -P 1 

and multimers thereof wherein F 1 is an Fc domain and is attached at the N- 
2 0 terminus of -(L^-P 1 ; and 
V 

F 1 -(L 1 ) c -P 1 -(L 2 ) d -P 2 

and multimers thereof wherein F 1 is an Fc domain and is attached at the N- 
terminus of -L'-P'-L'-P 2 . 
2 5 Peptides . Any number of peptides may be used in conjunction with 

the present invention. Of particular interest are peptides that mimic the 
activity of EPO, TPO, growth hormone, G-CSF, GM-CSF, IL-lra, leptin, 
CTLA4, TRAIL, TGF-a, and TGF-p\ Peptide antagonists are also of 
interest particularly those antagonistic to the activity of TNF, leptin, any 
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of the interleukins (IL-1, 2, 3, . . .), and proteins involved in complement 
activation (e.g., C3b). Targeting peptides are also of interest, including 
tumor-homing peptides, membrane-transporting peptides, and the like. 
All of these classes of peptides may be discovered by methods described in 
5 the references cited in this specification and other references. 

Phage display, in particular, is useful in generating peptides for use 
in the present invention. It has been stated that affinity selection from 
libraries of random peptides can be used to identify peptide ligands for 
any site of any gene product. Dedman etal. (1993), T- Biol. Chem . 268: 

10 23025-30. Phage display is particularly well suited for identifying peptides 
that bind to such proteins of interest as cell surface receptors or any 
proteins having linear epitopes. Wilson et al. (1998), Can. L Microbiol. 44: 
313-29; Kay et al . (1998), Drug Disc. Today 3: 370-8. Such proteins are 
extensively reviewed in Herz et al . (1997), T- Receptor & Signal 

15 Transduction Res . 17(5): 671-776, which is hereby incorporated by 

reference. Such proteins of interest are preferred for use in this invention. 

A particularly preferred group of peptides are those that bind to 
cytokine receptors. Cytokines have recently been classified according to 
their receptor code. See Inglot (1997), Archivum Immunologiae et 

2 0 Therapiae Experimentalis 45: 353-7, which is hereby incorporated by 

reference. Among these receptors, most preferred are the CKRs (family I in 
Table 3). The receptor classification appears in Table 3. 
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Table 3 — Cytokine Receptors Classified by Receptor Code 



Cytokines (ligands) 


Receptor Type 


family subfamily 


family subfamily 


I. Hematopoietic 1. IL-2, IL-4, IL-7, 
cytokines IL-9, IL-13, IL- 
15 

2. IL-3, IL-5, GM- 
CSF 

3. IL-6, IL-11, IL- 
12, LIF, OSM, 
CNTF, Leptin 
(OB) 

4. G-CSF, EPO, 
TPO, PRL, GH 

R TT 17 TLTA 7"C TT 

o. lL-1/, rivb-lL- 
17 


I. Cytokine R 1. shared yCr, IL- 
(CKR) 9R, IL-4R 

2. shared GP 140 
PR 

3. 3.shared RP 
130, IL-6 R, 
Leptin R 

4. "single chain" 
R, GCSF-R, 
TPO-R, GH-R 

5. other R c 


II. IL-10 ligands IL-10, BCRF-1, 
HSV-IL-10 


II. IL-10 R 


III. Interferons 1. IFN-al, a2, a4, 

m, t, IFN-(3 d 
2. IFN-Y 


III. Interferon R 1. IFNAR 
2. IFNGR 


IV. IL-landIL-1 1. IL-la, IL-lp, 
like ligands IL-IRa 

2. IL-18, IL-18BP 


IV. IL-1R 1. IL-1R,IL- 

lRAcP 
2. IL-18R, IL- 
18RAcP 


V. TNF family 1. TNF-a, TNF-(3 

(LT), FASL, 
CD40 L, 
CD30L, CD27 
L, OX40L, 
OPGL, TRAIL, 
APRIL, AGP-3, 
BLys, TL5, 
Ntn-2, KAY, 
Neutrokine-a 


3. NGF/TNF R e TNF-RI, AGP-3R, 
DR4, DR5, OX40, 
OPG, TACI, CD40, 
FAS, ODR 


VI. Chemokines 1. a chemokines: 

IL-8, GRO a, (3, 
Y, IF-10, PF-4, 
SDF-1 
2. P chemokines: 

lUTPIrv M7P1R 


4. ChemokineR 1. CXCR 
2. CCR 



1 IL-17R - belongs to CKR family but is unassigned to 4 indicated subjamilies. 

2 Other IFN type I subtypes remain unassigned. Hematopoietic cytokines, IL-10 ligands and 
interferons do not possess functional intrinsic protein kinases. The signaling molecules for the 
cytokines are JAK's, STATs and related non-receptor molecules. IL-14, IL-16 and IL-18 have been 
cloned but according to the receptor code they remain unassigned. 

3 TNF receptors use multiple, distinct intracellular molecules for signal transduction including 
"death domain" of FAS R and 55 kDa TNF-ocR that participates in their cytotoxic effects. NGF/TNF 
R can bind both NGF and related factors as well as TNF ligands. Chemokine receptors are seven 
transmembrane (7TM, serpentine) domain receptors. They are G protein-coupied. 
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MIPloc, MIPlp, 






MCP-1,2,3,4, 






RANTES, 






eotaxin 


3. 


CR 


3. y chemokines: 




DARC f 


lymphotactin 


4. 


VII. Growth factors 1.1 SCF, M-CSF, 


VII. RKF 1. 


TK sub-family 


PDGF-AA, AB, 


1.1 


IgTK III R, 


BB, KDR, FLT- 




VEGF-RI, 


1, FLT-3L, 




VEGF-RII 


VEGF, SSV- 






PDGF, HGF 7 SF 






1.2 FGFa, FGF(3 


1.2 


IgTK IV R 




1.3 


C vs tpin p-ti cY\ 


W-F19 (EGF- 




TK-I 


like) 






1.4 IGF-I, IGF-II, 


1.4 


Cysteine rich 


Insulin 




TK-II, IGF-RI 


1.5 NGF, BDNF, 


1.5 


Cysteine knot 


NT-3, NT-4 S 




TK V 


2. TGF-pi / |32 / p3 


2. 


Serine- 




threonine 
kinase 






subfamily 
(STKS) h 







Particular proteins of interest as targets for peptide generation in 
the present invention include the following: 
oovp3 

5 ocVpl 
Ang-2 
B7 

B7RP1 
CRP1 

10 Calcitonin 
CD28 
CETP 
cMet 

Complement factor B 
15 C4b 

CTLA4 



4 The Duffy blood group antigen (DARC) is an erythrocyte receptor that can bind several different 
chemokines. IL-1 R belongs to the immunoglobulin superfamily but their signal transduction events 
characteristics remain unclear. 

5 The neurotrophic cytokines can associate with NGF/TNF receptors also. 

6 STKS may encompass many other TGF-|3- related factors that remain unassigned. The protein 
kinases are intrinsic part of the intracellular domain of receptor kinase family (RKF). The enzymes 
participate in the signals transmission via the receptors. 
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Glucagon 
Glucagon Receptor 
LIPG 
MPL 

5 splice variants of molecules preferentially expressed on 

tumor cells; e.g., CD44, CD30 

unglycosylated variants of mucin and Lewis Y surface 
glycoproteins 
CD19, CD20, CD33, CD45 
10 prostate specific membrane antigen and prostate specific cell 

antigen 

matrix metalloproteinases (MMPs), both secreted and 
membrane-bound (e.g., MMP-9) 
Cathepsins 
1 5 angiopoietin-2 
TIE-2 receptor 
heparanase 

urokinase plasminogen activator (UP A), UPA receptor 
parathyroid hormone (PTH), parathyroid hormone-related 
2 0 protein (PTHrP), PTH-RI, PTH-RII 

Her2 
Her3 
Insulin — 



Exemplary peptides for this invention appear in Tables 4 through 
20 below. These peptides may be prepared by methods disclosed in the 
3 0 art. Single letter amino acid abbreviations are used. The X in these 

sequences (and throughout this specification, unless specified otherwise in 



1 IL-1 7R belongs to the CKR family but is not assigned to any of the 4 indicated subjamiiies. 
3 Other IFN type I subtypes remain unassigned. Hematopoietic cytokines, IL-10 ligands and 
interferons do not possess functional intrinsic protein kinases. The signaling molecules for the 
cytokines are JAK's, STATs and related non-receptor molecules. IL-1 4, IL-1 6 and IL-1 8 have been 
cloned but according to the receptor code they remain unassigned. 

k TNF receptors use multiple, distinct intracellular molecules for signal transduction including 
"death domain" of FAS R and 55 kDa TNF-aR that participates in their cytotoxic effects. NGF/TNF 
R can bind both NGF and related factors as well as TNF ligands. Chemokine receptors are G 
protein-coupled, seven transmembrane (7TM, serpentine) domain receptors. 
1 The Duffy blood group antigen (DARC) is an erythrocyte receptor that can bind several different 
chemokines. It belongs to the immunoglobulin superfamily but characteristics of its signal 
transduction events remain unclear. 

m The neurotrophic cytokines can associate with NGF/TNF receptors also. 
n STKS may encompass many other TGF-p-related factors that remain unassigned. The protein 
kinases are intrinsic part of the intracellular domain of receptor kinase family (RKF). The enzymes 
participate in the signals transmission via the receptors. 
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a particular instance) means that any of the 20 naturally occurring amino 
acid residues may be present. Any of these peptides may be linked in 
tandem (i.e., sequentially), with or without linkers, and a few tandem- 
linked examples are provided in the table. Linkers are listed as " r A" and 
5 may be any of the linkers described herein. Tandem repeats and linkers 
are shown separated by dashes for clarity. Any peptide containing a 
cysteinyl residue may be cross-linked with another Cys-containing 
peptide, either or both of which may be linked to a vehicle. A few cross- 
linked examples are provided in the table. Any peptide having more than 

1 0 one Cys residue may form an intrapeptide disulfide bond, as well; see, for 
example, EPO-mimetic peptides in Table 5. A few examples of 
intrapeptide disulfide-bonded peptides are specified in the table. Any of 
these peptides may be derivatized as described herein, and a few 
derivatized examples are provided in the table. Derivatized peptides in 

15 the tables are exemplary rather than limiting, as the associated 

underivatized peptides may be employed in this invention, as well. For 
derivatives in which the carboxyl terminus may be capped with an amino 
group, the capping amino group is shown as -NH 2 . For derivatives in 
which amino acid residues are substituted by moieties other than amino 

2 0 acid residues, the substitutions are denoted by a, which signifies any of 
the moieties described in Bhatnagar et al . (1996), T. Med. Chem . 39: 3814-9 
and Cuthbertson et al . (1997), T. Med. Chem . 40: 2876-82, which are 
incorporated by reference. The J substituent and the Z substituents (Z 5 , Z 6 , 
...Z 40 ) are as defined in U.S. Pat. Nos. 5,608,035 ,5,786,331, and 5,880,096, 

2 5 which are incorporated by reference. For the EPO-mimetic sequences 

(Table 5), the substituents X 2 through X n and the integer "n" are as defined 
in WO 96/40772, which is incorporated by reference. Also for the EPO- 
mimetic sequences, the substituents X na , X la , X 2a , X 3a , X 4a , X 5a and X ca follow 
the definitions of X n , X ir X 2 , X 3 , X 4 , X 5 , and X c , respectively, of WO 99/47151, 
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which is also incorporated by reference. The substituents "W," "6," and 
"+" are as defined in Sparks etal. (1996), Proc. Natl. Acad. Sci 93: 1540-4, 
which is hereby incorporated by reference. X 4 , X 5/ X 6 , and X 7 are as defined 
in U.S. Pat. No. 5,773,569, which is hereby incorporated by reference, 
5 except that: for integrin-binding peptides, X lf X 2 , X 3 , X 4 , X 5 , X 6 , X 7 , and X 8 
are as defined in International applications WO 95/14714, published June 
1, 1995 and WO 97/08203, published March 6, 1997, which are also 
incorporated by reference; and for VIP-mimetic peptides, X x , X/, X/', X 2 , X 3 , 
X 4 , X 5 , X 6 and Z and the integers m and n are as defined in WO 97/40070, 
10 published October 30, 1997, which is also incorporated by reference. Xaa 
and Yaa below are as defined in WO 98/09985, published March 12, 1998, 
which is incorporated by reference. AA a , AA 2 , AB 17 AB 2 , and AC are as 
defined in International application WO 98/53842, published December 3, 

1998, which is incorporated by reference. X 1 , X 2 , X 3 , and X 4 in Table 17 only 
15 are as defined in European application EP 0 911 393, published April 28, 

1999. Residues appearing in boldface are D-amino acids. All peptides are 
linked through peptide bonds unless otherwise noted. Abbreviations are 
listed at the end of this specification. In the "SEQ ID NO." column, "NR" 
means that no sequence listing is required for the given sequence. 

20 



Table 4 — IL-1 antagonist peptide sequences 



Sequence/structure 


SEQ 




ID NO: 


Z 11 Z 7 Z 8 QZ 5 YZ R Z B Z, n 


212 


XXQZ 5 YZ e XX 


907 


Z 7 XQZ,YZ„XX 


908 


Z^QZ.YZ^Z,, 


909 


Z^Z^QZ^Z^ 


910 


Zi 2^13^1 4^1 5^1 e^i 7^1 aZ 19 Z 20 Z 21 Z 22 Z i 1 Z 7 Z B QZ 5 YZ B Z 9 Z i0 L 


917 


Z 2 gNZ 24 Z g9 Z 25 Z 26 Z ?7 Zp R Z 29 Z 30 Z 40 


979 


TANVSSFEWTPYYWQPYALPL 


213 


SWTDYGYWQPYALPISGL 


214 


ETPFTWEESNAYYWQPYALPL 


215 
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ENTYSPNWADSMYWQPYALPL 


216 


SVGEDHN FWTS EYWQ P YALP L 


217 


DGYDRWRQSGERYWQPYALPL 


218 


FEWTPGYWQPY 


219 


FEWTPGYWQHY 


220 


FEWTPGWYQJY 


221 


AcFEWTPGWYQJY 


222 


FEWTPGWpYQJY 


223 


FAWTPGYWQJY 


224 


FEWAPGYWQJY 


225 


FEWVPGYWQJY 


226 


FEWTPGYWQJY 


227 


AcFEWTPGYWQJY 


228 


FEWTPaWYQJY 


229 


FEWTPSarWYQJY 


230 


FEWTPGYYQPY 


231 


FEWTPGWWQPY 


232 


FEWTPNYWQPY 


233 


FEWTPvYWQJY 


234 


FEWTPecGYWQJY 


235 


FEWTPAibYWQJY 


236 


FEWTSarGYWQJY 


237 


FEWTPGYWQPY 


238 


FEWTPGYWQHY 


239 


FEWTPGWYQJY 


240 


AcFEWTPGWYQJY 


241 


FEWTPGW-pY-QJY 


242 


FAWTPGYWQJY 


243 


FEWAPGYWQJY 


244 


FEWVPGYWQJY 


245 


FEWTPGYWQJY 


246 


AcFEWTPGYWQJY 


247 


FEWTPAWYQJY 


248 


FEWTPSarWYQJY 


249 


FEWTPGYYQPY 


250 


FEWTPGWWQPY 


251 


FEWTPNYWQPY 


252 


FEWTPVYWQJY 


253 


FEWTPecGYWQJY 


254 


FEWTPAibYWQJY 


255 


FEWTSarGYWQJY 


256 


FEWTPGYWQPYALPL 


257 


1 NapEWTPGYYQJY 


258 


YEWTPGYYQJY 


259 


FEWVPGYYQJY 


260 


FEWTPSYYQJY 


261 


FEWTPNYYQJY 


262 


TKPR 


263 


RKSSK 


264 


RKQDK 


265 
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NRKQDK 


266 


1""* 1 S i~\ T~\ 1 f\ 

RKQDKR 


267 


ENRKQDKRF 


268 


VTKFYF 


269 


VTKFY 


270 


VTDFY 


271 


SHLYWQPYSVQ 


671 


TL V Y WQ P YS LQT 


672 


RG DYWQ P YS VQS 


673 


VH V Y WQ P YS VQT 


674 


RLVYWQ P YS VQT 


675 


SRVWFQPYSLQS 


676 


N M V Y WQ P YS 1 QT 


677 


SVVFWQPYSVQT 


678 


TFVYWQPYALPL 


679 


TL V Y WQ P YS 1 Q R 


680 


R L V Y WQ P YS V Q R 


681 


SP VF WQ P YS IQ 1 


682 


Wl E W WQ PYS VQS 


683 


SLIYWQPYSLQM 


684 


TR L Y WQ P YS VQ R 


685 


RC D Y WQ PYS VQT 


686 


M R VF WQ P YS VQ N 


687 


KIVYWQPYSVQT 


688 


R H L Y WQ P YS VQ R 


689 


ALV W WQ PYS EQ 1 


690 


SRVWFQPYSLQS 


691 


WEQPYALPLE 


692 


Q L V W WQ P YS VQ R 


693 


D LR Y WQ P YS VQ V 


694 


ELVWWQPYSLQL 


695 


DLVWWQPYSVQW 


696 


NGN Y WQ PYS FQ V 


697 


ELV Y WQ PYS IQ R 


698 


ELM Y WQ P YS VQ E 


699 


N LLY WQ PYS M Q D 


700 


GYEWYQPYSVQR 


701 


S R V W YQ P YS VQ R 


702 


LS EQ YQ P YS VQ R 


703 


GGGWWQPYSVQR 


704 


VG R W YQ P YS VQ R 


705 


VnvYWUrYoVUH 


706 


QARWYQPYSVQR 


707 


VHVYWQPYSVQT 


708 


RSVYWQPYSVQR 


709 


TRVWFQPYSVQR 


710 


GRIWFQPYSVQR 


711 


GRVWFQPYSVQR 


712 


ARTWYQPYSVQR 


713 


ARVWWQPYSVQM 


714 
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RLMFYQPYSVQR 


715 


ES M W YQ P YS VQ R 


716 


H FGWWQPYSVHM 


717 


ARFWWQPYSVQR 


718 


RLVYWQ PYAP1Y 


719 


RLVYWQ PYSYQT 


720 


RLVYWQ PYSLPI 


721 


RLVYWQ PYSVQA 


722 


SRVWYQ PYAKGL 


723 


SRVWYQ PYAQGL 


724 


SRVWYQ PYAMPL 


725 


SRVWYQ PYSVQA 


726 


SRVWYQ PYSLGL 


727 


SRVWYQ PYAREL 


728 


SRVWYQ PYSRQP 


729 


SRVWYQ PYFVQP 


730 


EYEWYQ PYALPL 


731 


IPEYWQ PYALPL 


732 


SR1WWQ PYALPL 


733 


DPLFWQ PYALPL 


734 


SRQWVQ PYALPL 


735 


IRSWWQ PYALPL 


736 


RGYWQ PYALPL 


737 


RLLWVQ PYALPL 


738 


EYRWFQ PYALPL 


739 


DAYWVQ PYALPL 


740 


WSGYFQ PYALPL 


741 


NIEFWQ PYALPL 


742 


TRDWVQ PYALPL 


743 


DSSWYQ PYALPL 


744 


IGNWYQ PYALPL 


745 


NLRWDQ PYALPL 


746 


LPEFWQ PYALPL 


747 


DSYWWQ PYALPL 


748 


RSQYYQ PYALPL 


749 


ARFWLQ PYALPL 


750 


NSYFWQ PYALPL 


751 


R FM YWQ P YS VQ R 


752 


AH LF WQ P YS VQ R 


753 


WWQ PYALPL 


754 


YYQPYALPL 


755 


YFQPYALGL 


756 


YWYQ PYALPL 


757 


RWWQPYATPL 


758 


GWYQPYALGF 


759 


YWYQPYALGL 


760 


IWYQPYAMPL 


761 


SNMQPYQRLS 


762 


TFVYWQPY AVG LPAAETACN 


763 


TFVYWQPY SVQMTITGKVTM 


764 



-34- 



WO 01/83525 



PCT/US01/14310 



TFVYWQPY SSHXXVPXGFPL 


765 


TFVYWQPY YGNPQWAIHVRH 


766 


TFVYWQPY VLLELPEGAVRA 


767 


TFVYWQPY VDYVWPIPIAQV 


768 


GWYQPYVDGWR 


769 


RWEQPYVKDGWS 


770 


T~% A FV X N X A 1 X~\ l H f A F'X 

EWYQPYALGWAR 


771 


X*X 1 ■ ii i | A r-«X X A x^X 1 

GWWQPYARGL 


772 


LFEQPYAKALGL 


773 


GWEQPYARGLAG 


774 


AWVQPYATPLDE 


775 


M W YQ P YSS Q P A E 


776 


GWTQ PYSQQG E V 


777 


DWFQPYSIQSDE 


778 


PWIQPYARGFG 


779 


RPLYWQPYSVQV 


780 


— i— ' ■ |\ /l i I x*"*V X /™V \ * /*""X • 

TLIYWQPYSVQI 


781 


r-x r~^\ xt a # x**x i~xx X /*X i~x x*^*i" 

RFDYWQPYSDQT 


782 


WHQFVQPYALPL 


783 


EWDS VYWQPYSVQ TLLR 


784 


% a # r~~ x™x a i \ #\ xi a ix*"x i — \\ //xi / x-x /™\ r™~ a 

WEQN VYWQPYSVQ SFAD 


785 


SDV VYWQPYSVQ SLEM 


786 


\ XX x i~h x**x \ #x xi ji i/% tn\ x /*x % * x*x i r ■ ji a 

YYDG VYWQPYSVQ VMPA 


787 


SDIWYQ PYALPL 


788 


QRIWWQ PYALPL 


789 


SRIWWQ PYALPL 


790 


RSLYWQ PYALPL 


791 


TIIWEQ PYALPL 


792 


WETWYQ PYALPL 


793 


SYDWEQ PYALPL 


794 


SRIWCQ PYALPL 


795 


EIMFWQ PYALPL 


796 


DYVWQQ PYALPL 


797 


MDLLVQ WYQPYALPL 


798 


X™\ fX I/I /|| 1 A jl\ /rt i-XX X A 1 I 

GSKVIL WYQPYALPL 


799 


T"X X"X X~*V A A 1 1 \ A fX Xx"X\ n\ x a 1 f\ 1 

RQGANi WYQPYALPL 


800 


x"x x**x x"~\ r™x r™i i a fx x x^x i — %\ x a i n 1 

GGGDEP WYQPYALPL 


801 


X*X /™\ 1 I" - P^™¥~ I A IX //> t— |\ X A 1 r*x 1 

SQLERT WYQPYALPL 


802 


ETWVRE WYQPYALPL 


803 


KKGSTQ WYQPYALPL 


804 


1 /"X A I™* R A A 1 I A FX //X l"XX x A 1 l"X 1 

LQARMN WYQPYALPL 


805 


trnbUK WYQPYALPL 


806 


VKQKWR WYQPYALPL 


807 


LRRHDV WYQPYALPL 


808 


RSTASI WYQPYALPL 


809 


ESKEDQ WYQPYALPL 


810 


EGLTMK WYQPYALPL 


811 


EGSREG WYQPYALPL 


812 


VIEWWQ PYALPL 


813 


VWYWEQ PYALPL 


814 
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ASEWWQ PYALPL 


815 


p"\/r"\A/iAi/™\ n\/Ai ni 

FYEWWQ PYALPL 


816 


EGWWVQ PYALPL 


817 


WGEWLQ PYALPL 


818 


DYVWEQ PYALPL 


819 


A 1 1 Tl A Fl A f i*^v I — "\X / A I l - *! 1 

AHTWWQ PYALPL 


820 


F1EWFQ PYALPL 


821 


WLAWEQ PYALPL 


822 


VMEWWQ PYALPL 


823 


ERMWQ PYALPL 


824 


NXXWXX PYALPL 


825 


WGNWYQ PYALPL 


826 


TLYWEQ PYALPL 


827 


VWRWEQ PYALPL 


828 


LLWTQ PYALPL 


829 


SRIWXX PYALPL 


830 


SDIWYQ PYALPL 


831 


WGYYXX PYALPL 


832 


TSGWYQ PYALPL 


833 


\ /I I^^X^X^X^ r~\x, r A 1 I 

VHPYXX PYALPL 


834 


EHSYFQ PYALPL 


835 


XX1WYQ PYALPL 


836 


A j<~x til /™\ t™\x /At r~v i 

AQLHSQ PYALPL 


837 


WANWFQ PYALPL 


838 


SRLYSQ PYALPL 


839 


GVTFSQ PYALPL 


840 


SIVWSQ PYALPL 


841 


SRDLVQ PYALPL 


842 


HWGH VYWQPYSVQ DDLG 


843 


SWHS VYWQPYSVQ SVPE 


844 


WRDS VYWQPYSVQ PESA 


845 


TWDA VYWQPYSVQ KWLD 


846 


TPPW VYWQPYSVQ SLDP 


847 


YWSS VYWQPYSVQ SVHS 


848 


X ^1 A f X y ^^X / A ■ ✓"'v 1 

YWY QPY ALGL 


849 


\n«l\/ /—v i — \x x At i— v ■ 

YWY QPY ALPL 


850 


EWI QPY ATGL 


851 


NWE QPY AKPL 


•852 


AFY QPY ALPL 


853 


FLY QPY ALPL 


854 


VCK QPY LEWC 


855 


i rni — v\ ah — pom a\/\/u i r~\ r-w / a i r~ji 

b 1 PF 1 Whhb NAY YWQ PYALPL 


856 


QGWLTWQDSVDMYWQPYALPL 


857 


FSEAGYTWPENTYWQPYALPL 


858 


TESPGGLDWAKIYWQPYALPL 


859 


DGYDRWRQSGERYWQPYALPL 


860 


TANVSSFEWTPGYWQPYALPL 


861 * 


SVGEDHNFWTSE YWQ PYALPL 


862 


MNDQTSEVSTFP YWQPYALPL 


863 


SWSEAFEQPRNL YWQPYALPL 


864 
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s~\\s a i — no a i k i r~ \\ a \/\A/An\/Ai m 

QYAEPSALNDWG YWQPYALPL 


865 


NGDWATADWSNY YWQPYALPL 


866 


THDEHI YWQPYALPL 


867 


MLEKTYTTWTPG YWQPYALPL 


868 


WSDPLTRDADL YWQPYALPL 


869 


SDAF 1 1 QDSQAM YWQPYALPL 


870 


GDDAAWRTDSLT YWQPYALPL 


871 


AIIRQLYRWSEM YWQPYALPL 


872 


ENTYSPNWADSM YWQPYALPL 


873 


MNDQTSEVSTFP YWQPYALPL 


874 


SVGEDHNFWTSE YWQPYALPL 


875 


QTPFTWEESNAY YWQPYALPL 


876 


ENPFTWQESNAY YWQPYALPL 


877 


VTPFTWEDSNVF YWQPYALPL 


878 


QIPFTWEQSNAY YWQPYALPL 


879 


QAPLTWQESAAY YWQPYALPL 


880 


EPTFTWEESKAT YWQPYALPL 


881 


1 1 1 LTWEESNAY YWQPYALPL 


882 


ESPLTWEESSAL YWQPYALPL 


883 


ETPLTWEESNAY YWQPYALPL 


884 


FATFTWAF^MAY YWOPYAI PI 


OOD 


FAI FTWK'FQTAY YWDPVAI PI 


OOO 


^TP-TWFFCiMAY Y\A/nPVAI PI 
O 1 S 1 VVCCOINn T T VVVoir T nLrL 


00/ 


FTPFTWPFC1MAY Y\A/OPYAl PI 

L_ 1 r r 1 VV L_L_.OINAA I I WV^r i nLrL 


OOO 


k'APFTWFF^HAY VWHPYAI PI 
l\nr r 1 VV CI_OUn T I VVUr T nLrL 




QTCpT\A/pCQMAV YWOPYAI PI 

a 1 Or 1 VvttolMAY I vvUr TMLrL 




DQTFTWFFQMAY YWOPYAI PI 
L/o 1 r 1 VVttolMMY Y VVUr TALrL 




VIPFTIA/FFQMAV VIA^HPVAI PI 
Y In r I VVctoIMM Y Y VVUr YALrL 




nTAFT\A/FFQMAY YWOPVAI PI 
U 1 Ar I VVttOlNAT Y VVLJr Y MLr l_ 




ETLFTWEESNAT YWQPYALPL 


894 


VSSFTWEESNAY YWQPYALPL 


895 


QPYALPL 


896 


Py-1 -NapPYQJYALPL 


897 


TANVSSFEWTPG YWQPYALPL 


898 


FEWTPGYWQPYALPL 


899 


FEWTPGYWQJYALPL 


900 


FEWTPGYYQJYALPL 


901 


ETPFTWEESNAYYWQPYALPL 


902 


FTWEESNAYYWQJYALPL 


903 


ADVL YWQPYA PVTLWV 


904 


bUVAt YWUrYA LrLloL 


905 


SWTDYG YWQPYA LPISGL 


906 


FEWTPGYWQPYALPL 


911 


FEWTPGYWQJYALPL 


912 


FEWTPGWYQPYALPL 


913 


FEWTPGWYQJYALPL 


914 


FEWTPGYYQPYALPL 


915 


FEWTPGYYQJYALPL 


916 


TANVSSFEWTPGYWQPYALPL 


918 
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C\A/Tr\VPVlA/nDVAI nicr 1 ! 

oW 1 U YbYWUr YALrloGL 




h 1 rr 1 WbboNAY YWUrYALrL 


nor* 


cMT\/or»M\ft / a nOR i\/\A/r\nv a i ni 

ENTYbPNWADoMYWUPYALPL 


921 


o\//^i~t - m in i r~\ a /to r~\/\ a !t~\ n\/ a i ni 

SVGEDHNFWTobYWUPYALPL 


922 


DGYDRWRQSGERYWQPYALPL 


923 


FEWTPGYWQPYALPL 


924 


FEWTPGYWQPY 


925 


FEWTPGYWQJY 


926 


EWTPGYWQPY 


927 


FEWTPGWYQJY 


928 


AEWTPGYWQJY 


929 


FAWTPGYWQJY 


930 


FEATPGYWQJY 


931 


FEWAPGYWQJY 


932 


FEWTAGYWQJY 


933 


r— i a #-¥- 1— » A\/|> FX*V | \ y 

FEWTPAYWQJY 


934 


FEWTPGAWQJY 


935 


FEWTPGYAQJY 


936 


FEWTPGYWQJA 


937 


F E WTG G Y WQ J Y 


938 


F EWTPGYWQJY 


939 


1 — 1 — \ A IT" l/~^\/IA//™\ 1\/ 

F E WT J G YW Q J Y 


940 


FEWTPecGYWQJY 


941 


1 — I — \ a l~T~ i — 1 A :u.\/\ A t /~\ 1 \ / 

FEWTPAibYWQJY 


942 


FEWTPSarWYQJY 


943 


I - I - * A *~r~ /"k\ /* A »yv J \ X 

FEWTSarGYWQJY 


944 


(— 1 — \ A IT" 1 — > M\/l A //""" \ IX/ 

FEWTPNYWQJY 


945 


FEWTPVYWQJY 


946 


FE WT V P Y W Q J Y 


947 


A „l~ r"\A JTHA \ Al\/A 1 \ / 

AcFEWTPGWYQJY 


948 


AcFEWTPGYWQJY 


949 


INap-EWTPGYYQJY 


950 


YEWTPGYYQJY 


951 


FEWVPGYYQJY 


952 


FE WTPG YYQ J Y 


953 


r"r~\ a /Tn^\/\/A i\ / 

F E WT PsY YQ J Y 


954 


FEWTPnYYQJY 


955 


bHLY-Nap-QPYSVQM 


956 


T L V Y - N ap -Q P YS LQT 


957 


RG D Y-Nap-QPYSVQS 


958 


NMVY-Nap-QPYSIQT 


959 


\/VWO PVQ\/H 
V Y VVvJrYoVU 


you 


VY-Nap-QPYSVQ 


961 


TFVYWQJYALPL 


962 


FEWTPGYYQJ-Bpa 


963 


XaaFEWTPGYYQJ-Bpa 


964 


FEWTPGY-Bpa-QJY 


965 


AcFEWTPGY-Bpa-QJY 


966 


FEWTPG-Bpa-YQJY 


967 


AcFEWTPG-Bpa-YQJY 


968 
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AcFE-Bpa-TPGYYQJY 


969 


AcFE-Bpa-TPGYYQJY 


970 


Bpa-EWTPGYYQJY 


971 


AcBpa-EWTPGYYQJY 


972 


VYWQPYSVQ 


973 


R LVYWQ P YS VQ R 


974 


RLVY-Nap-QPYSVQR 


975 


RLDYWQPYSVQR 


976 


RLVWFQPYSVQR 


977 


RLVYWQPYSIQR 


978 


DNSSWYDSFLL 


980 


DNTAWYESFLA 


981 


DNTAWYENFLL 


982 


PARE DNTAWYDSFLI WC 


983 


TSEY DNTTWYEKFLA SQ 


984 


SQIP DNTAWYQSFLL HG 


985 


SPFI DNTAWYENFLL TY 


986 


EQ1Y DNTAWYDHFLL SY 


987 


TPFI DNTAWYENFLL TY 


988 


TYTY DNTAWYERFLM SY 


989 


TMTQ DNTAWYENFLL SY 


990 


Tl DNTAWYANLVQ TYPQ 


991 


Tl DNTAWYERFLA QYPD 


992 


HI DNTAWYENFLL TYTP 


993 


SQ DNTAWYENFLL SYKA 


994 


Ql DNTAWYERFLL QYNA 


995 


NQ DNTAWYESFLL QYNT 


996 


Tl DNTAWYENFLL NHNL 


997 


HY DNTAWYERFLQ QGWH 


998 


ETPFTWEESNAYYWQPYALPL 


999 


Yl PFTWEESN AYYWQPYALPL 


1000 


DG YDRWRQSG ERYWQPYALPL 


1001 


pY-INap-pY-QJYALPL 


1002 


TANVSSFEWTPGYWQPYALPL 


1003 


FEWTPGYWQJYALPL 


1004 


FEWTPGYWQPYALPLSD 


1005 


FEWTPGYYQJYALPL 


1006 


FEWTPGYWQJY 


1007 


Ac F EWTPGYWQJY 


1008 


AcFEWTPGWYQJY 


1009 


Ac F EWTPGYYQJY 


1010 


AcFEWTPaYWQJY 


1011 


AcFEWTPaWYQJY 


1012 


AcFEWTPaYYQJY 


1013 


FEWTPGYYQJYALPL 


1014 


FEWTPGYWQJYALPL 


1015 


FEWTPGWYQJYALPL 


1016 


TANVSSFEWTPGYWQPYALPL 


1017 


AcFEWTPGYWQJY 


1018 


AcFEWTPGWYQJY 


1019 
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AcFEWTPGYYQJY 


1020 


AcFEWTPAYWQJY 


1021 


AcFEWTPAWYQJY 


1022 


AcFEWTPAYYQJY 


1023 
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Table 5 — EPO-mimetic peptide sequences 



Sequence/structure 


SEQ 
ID NO: 


YXCXXGPXTWXCXP 


83 


VVOYYODYHA/Y^YD VVrYVf DVHA/YPVD 

YAOAALarA 1 VV AU Ar- Y AO AAu r A I WAUAr 




YXCXXGPXTWXCXP-A-YXCXXGPXTWXCXP 


85 


YXCXXGPXTWXCXP-A- , . , 

V (e-amine) 

\ 


86 


K 

/ 

YXCXXGPXTWXCXP-A- (a-amine) 


86 


G GTYSCH FG PLTWVCKPQGG 


87 


GGDYHCRMGPLTWVCKPLGG 


(JU 


GGVYACRMGPITWVCSPLGG 


89 


VGNYMCHFGPITWVCRPGGG 


90 


GGLYLCRFGPVTWDCGYKGG 


91 


GGTYSCHFGPLTWVCKPQGG- 
GGTYSCH FG PLTWVCKPQGG 


92 


G GTYSCH FG PLTWVCKPQGG -A- 
GGTYSCH FG PLTWVCKPQGG 


93 


GGTYSCH FG PLTWVCKPQGGSSK 


94 


r~v PTVCr^U rpni TlftA/Pl/DAP^OOl/ 

Ubi 1 YoL-Hr-taKL 1 WVUKrUiaubbK- 

(^nTY^P.I-lF^PI T\AA/nkTPnf^nQQk' 
1 I Ovn ~\Ji r L. I Vv V wr\r O Or\ 


95 


I I OunrurLI vv vuixr WuuOoia iv 

G GT YS C H FG PLTWVCKPQG G SS K 




GGTYSCHFGPLTWVCKPQGGSS 

\ (e-amine) 


97 


\ 

/ 
R A 

GGTYSCHFGPLTWVCKPQGGSS (ot-amine) 


97 


GGTYSCH FG PLTWVCKPQGGSSK(-A-biotin) 


98 


CX„X R GPXJWX 7 C 


421 


GGTYSCHGPLTWVCKPQGG 


422 


VGNYMAHMGPITWVCRPGG 


423 


GGPHHVYACRMGPLTWIC 


424 


G GTYSCH FGPLTWVCKPQ 


425 


GGLYACHMGPMTWVCQPLRG 


426 


TIAQYICYMGPETWECRPSPKA 


427 


YSCHFGPLTWVCK 


428 


YCHFGPLTWVC 


429 


X s X,X 5 GPXJWX 7 X n 


124 


YX^XAGPXJWX^, 


461 
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419 


X 1 YX ? CX 4 X R GPX R TWX 7 GX 9 X i0 X 11 


420 


GGLYLCRFGPVTWDGGYKGG 


1024 


GGTYSCHFGPLTWVCKPQGG 


1025 


GGDYHCRMGPLTWVCKPLGG 


1026 


VGNYMCHFGPITWVCRPGGG 


1029 


GGVYAGRMGPITWVCSPLGG 


1030 


VCaNYIvlAnlvlCaPn W VURrGCa 


1035 


G GTYSCH FG PLTWVCKPQ 


1036 


GGLYACHMGPMTWVCQPLRG 


1037 


TIAQYICYMGPETWECRPSPKA 


1038 


YSCHFGPLTWVCK 


1039 


YCHFGPLTWVC 


1040 


SCHFGPLTWVCK 


1041 


(AX 2 ) n X a X<X 5 GPXJWX 7 X R 


1042 


X n CX,X ; ,GWVGX 3 CX J X,WX R 


1110 
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Table 6 — TPO~mimetic peptide sequences 



Sequence/structure 


SEQ 




ID NO: 


IEGPTLRQWLAARA 


13 


iconTi qnU/l A Al/A 
1 1 1 Lr1wVVLM/Ar\A 


lf± 


IEGPTLREWLAARA 


25 


IEGPTLRQWLAARA-A-IEGPTLRQWLAARA 


26 


1 EG PTLRQWLAAKA-A-IEG PTLRQWLAAKA 


27 


IEGPTLRQCLAARA-A-IEGPTLRQCLAARA 

1 1 


28 


ipf^PTI ROW! AARA-A-kYRrAf^-A-IFf^PTI ROWI AARA 


9Q 


IFf^PTI ROWI AARA-A-K7PFR , UA-IFnPTI ROWI AARA 




ip/^p-ri ROHI AARA-A-IFHPT! ROWI AARA 


Jl 


1 

IEGPTLRQCLAARA-A-IEGPTLRQWLAARA 


31 


IEGPTLRQWLAARA-A-IEGPTLRQCLAARA 


32 


| 

1 EG PTLRQ WLAARA-A-IEG PTLRQCLAAR A 


32 


VRDQIXXXL 


33 


TLREWL 


34 


GRVRDQVAGW 


35 


GRVKDQIAQL 


36 


GVRDQVSWAL 


37 


ESVREQVMKY 


38 


SVRSQISASL 


39 


GVRETVYRHM 


40 


GVREVIVMHML 


41 


GRVRDQIWAAL 


42 


AGVRDQILIWL 


43 


GRVRDQIMLSL 


44 


GRVRDQI(X)„L 


45 


CTLRQWLQGC 


46 


CTLQEFLEGC 


47 


CTRTEWLHGC 


48 


CTLREWLHGGFC 


49 


CTLREWVFAGLC 


50 


CTLRQWLILLGMC 


51 


CTLAEFLASGVEQC 


52 


CSLQEFLSHGGYVC 


53 


CTLREFLDPTTAVC 


54 


CTLKEWLVSHEVWC 


55 


CTLREWL(X) 2 . 6 C 


56-60 


REGPTLRQWM 


61 


EGPTLRQWLA 


62 


ERGPFWAKAC 


63 


REGPRCVMWM 


64 


CGTEG PTLSTWLDC 


65 
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CEQDGPTLLEWLKC 


66 


CELVGPSLMSWLTC 


67 


CLTGPFVTQWLYEC 


68 


CRAGPTLLEWLTLC 


69 


CADGPTLREWISFC 


70 


C(X) i . ? EGPTLREWL(X) 1 . 2 C 


71-74 


GGCTLREWLHGGFCGG 


75 


GGCADGPTLREWISFCGG 


76 


GNADGPTLRQWLEGRRPKN 


77 


LAIEGPTLRQWLHGNGRDT 


78 


H G R VG PTLR E WKTQ VATKK 


79 


TIKGPTLRQWLKSREHTS 


80 


IS DG PTLKEWLS VTRG AS 


81 


SIEGPTLREWLTSRTPHS 


82 
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Table 7 — G-CSF-mimetic peptide sequences 



Sequence/structure 


SEQ 




yy 


EEDCK 


99 


1 




i— r— i— \ i/ 

EEDgK 


100 




1 nn 

JLUU 


1 

EEDaK 


100 


pGluEDaK 


101 


pGluEDcK 
1 


101 


pGluEDcrK 


101 


PicSDaK 


102 


PicSDaK 


102 


1 

PicSDaK 


102 


EEDCK-A-EEDCK 


103 


EEDXK-A-EEDXK 


104 
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Table 8 — TNF-antagonist peptide sequences 



Sequence/structure 


SEO 




TF> MO- 
IL-/ JAI W. 




lUO 




1U/ 


YUr 1 liotNnUY 


lUo 


CO A OCMLIPV 

rOAoblNlnOY 


Ivy 


YUAotlMnUY 


ill) 


CPMCCMUPV 


1 -1 -i 
111 


rUNotNnUY 


112 


CPMC\/rMDPV 

r O JNJ o V nlN riL/ Y 


llo 


vrcnc\/CMr\Pc 
YuoUbvoNUbr 


1 "l A 

114 


FCVSNDRCY 


115 


YCRKELGQVCY 


116 


YCKEPGQCY 


117 


YCRKEMGCY 


118 


FCRKEMGCY 


119 


YCWSQNLCY 


120 


YCELSQYLCY 


121 


YCWSQNYCY 


122 


YCWSQYLCY 


123 


DFLPHYKNTSLGHRP 


1085 


AA 1 -AB 1 

\ 

AC 

/ 

AA^-AB, 


NR 
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Table 9^Integrin-binding peptide sequences 



Sequence/structure 


SEQ 




TD NO- 


py FTy wy 

ri/x^C- i /\ 2 vv/\ 3 






/MO 


rivU L-/ v3 A 


4fc4fcD 


V-/ Pi Vjl \Ji A. O 


AAA 


py y pi r>Y y p 


AAR 


UnnnLUnrU 






AA7 


y y y Rf^ny y y 

A 1 A 2 A 3 M ^ UA 4 A 5 A 6 


/IAS 


py PRf^npy p 


AAQ 


pnPR^nPFP 




pnpRpnpi p 


AR1 


pi pRf^nniP 


AR9 


y y nny yyy 


AR^ 
4K20 


yyy nny y y y y 

A 1 A 2 A 3 U U A 4 A s A 6 A 7 A 8 


ARA 


HWRDRWI P 

V_/VVI— 'L/UVV L_ W 


A^ 


pwnm ww i p 

V_/ VVUULVV VV I— V_/ 


AR6 


nwnn^i mp 




pwdhrwmp 


ARR 


uOVV UUuVV LU 


ARQ 


ppnni wwi p 

wrUULVV VV 1_0 


AfS\ 
4fcoU 




NTT? 


pc;i 


NTT? 




NTT? 




1 071 


PMPRPX/QPPAPRP 


1 079 


PI c?PQI QP 


1 07^ 


Rf^n 


NTT? 


MflR 


NTT? 




NTT? 


i N \z* nnnn 


107A 


PNHRP 


107S 

JLU/ O 


PDPRHDPFP 


J-U/ u 


pn^i \/rp 


1 077 

JLU/ / 


DLXXL 


1043 


RTDLDSLRTYTL 


1044 


RTDLDSLRTY 


1053 


RTDLDSLRT 


1054 


RTDLDSLR 


1078 


GDLDLLKLRLTL 


1079 


GDLHSLRQLLSR 


1080 


RDDLHMLRLQLW 


1081 


SSDLHALKKRYG 


1082 


RGDLKQLSELTW 


1083 


RGDLAALSAPPV 


1084 
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Table 10 — Selectin antagonist peptide sequences 



S e quence/ structure 


SEQ 




ID NO: 




147 


Ul 1 VV L/C Lvv rxllVIIN 


14ft 


hvt\a/ppi whmmo 

U I 1 VV rCL vVL/IVllVIV^ 


14Q 


OITWAPil WMMMk' 
vji 1 V VnULVV INlVllVlrX 


1 


Ulvl 1 VVnlJLVV 1 LIVIo 


1^1 


U YoVV nULVVcMlVIo 


1 R9 


tzl 1 VVUVJLVV tZVlVIlN 


1 


W\/Q\A/POI WniMM 
il VoVVlZWLVVLJllVllN 


1 ^4. 


rll 1 VVUL>JL-vvriHVI I 




ruM IVIoVVI_PLVVcniVlr\ 


1 ^ft 


AC\A/T\A/nni \a/w\/mmpapqo 


1 


UIRAP\A/1 Al WPPiMQP 
rlriAtZVVLALVV n.\J. IVIOr 


1 Rft 

IvJO 


k'k'PnWI Al WRIMQV 
r\r\C U VV LAL V V rt 1 ivl O V 


1 RQ 


iTwnoi Wni hAW 


160 


Ul I VV U/VoJLVV L/L-IVIr\ 


161 


ui i vv LJV0JL.VV uuivir\ 




Ul 1 VV LJVoJLVVUl-IVIrx 


163 


nONRYTDL VAIONKNF 

V - /V_>< 1 M 1 ill l_> 1 , V / V J V-x 1 ^ I X. 1 M 1 — 


462 


AENWADNEPNNKRNNED 


463 


RKNNKTWTWVGTKKALTNE 


464 


KKALTN EAENWAD 


465 


CQXRYTDLVA1QNKXE 


466 


RKXNXXWTWVGTXKXLTEE 


467 


AENWADGEPNNKXNXED 


468 


CXXXYTXLVAIQN KXE 


469 


RKXXXXWXWVGTXKXLTXE 


470 


AXNWXXXEPNNXXXED 


471 


XKXKTXEAXNWXX 


472 
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Table 11 — Antipathogenic peptide sequences 



Sequence/structure 


SEQ 
TO NO- 


HPPAI IPK'IIQQPI PKTI 1 CiAVrSQAI QQQ^f^OPi 




r^ppAI IPKIIQCiPl PKTI 1 ^lAVnciAl Qooor^OP 
CirrnLlri\uOOrLrr\ 1 LLOn V vjOMLOoOv^l V3vj{lZ 




f^PPAl IPk'IIQQPI PK'TI I QA\/ 


DUO 


f^PPAl IDKIIQQPI PK'TI 1 QAV 
UrrnLlrrvllOOr Lrl\ 1 LLOMv 


OVJO 


1/TiPPAI IDKIIQQPI PKTI 1 CAW 


R07 


|/|/ficpA| IDk'IIQQPI PKTI 1 QA\/ 
i\r\urrnLiri\iioorLrf\ i llom v 




K'kY^PPAl IDKIIQQPI PK'TI 1 QAW 


^OQ 


OPPAI IDKI1Q 


n 

D1U 


r±\niA\/\ Lf\/i TTf^l PAI lQVV/IKRK'ROPk 


Oil 


Ol^AVLrxvL- 1 I ALIO v V irvrarvliWw 


^19 


ri\riA\f\ KVI TT^il PAI IQWIl^Pk'Pnn 


OiO 


narzA\f\ w\t\ tt^i pai iqwik'r 


Di4t 


AVI KA/1 TTf^l PAI IQ\A/lk'R 
AVi_r\vi_i I oLr^ALiovvii\n 


OiO 


kl 1 1 1 1 K'l 1 1 1 K 
iN-l—L-Ll—Lrxl— l_L.L.r\ 


o±o 


Ufl 1 1 VC\ 1 1 Wl 1 W 


01/ 


K'l 1 1 kl Wt K'l 1 K 
r\LLLr\Lr\Lr\LLt\ 




WVC\ 1 k'l k'l vc\ wvc 


01:7 


k'l i i wi i i k'l i k' 




ki i i k'l vc\ w\ i k' 

rxl_ I— L. r\ l_ r\ L r\L_ L r\ 


R91 


K'l 1 1 1 k' 
t\l-L.L_I_r\ 


^99 

3ZZ 


K'l 1 1 k'l 1 K' 


c 9^ 

OZO 


k"i 1 1 k"i w w i k* 


R.9A 

□Z*± 


k'l 1 1 k'l K'l Kl 1 


OZO 


K'l 1 1 k'l K\ K'l 1 K 


^9£ 
DZO 


k' A A A k" A A A k' A A K 




k'\AA/k'\AA/K\A/k' 
r\V V V r\v vVrvVVrx 


1^9 ft 


kAAA/kA/KA/KAA/K 
rx V V V r\ V r\ V r\ V V r\ 


t^9Q 


KAA A / kV K\ / K\ / k' 
r\ V V Vr\VI\Vl\vl\ 




r\vvvr\Vr\vKvVr\ 


DOl 


k'l II k'l 


^^9 


KA/I Ml 1 
r\ v LriL.L. 




l_r\L_riL-L. 




k'PI Ml 1 




k'l li i^i \/r 




KVFHI 1 HI 

l\V 1 1 1 L_L_ 1 I L_ 


c ^7 


HKFRILKL 


538 


KPFHILHL 


539 


KIIIKIKIKIIK 


540 


KIIIKIKIKIIK 


541 


KIIIKIKIKIIK 


542 


KIPIKIKIKIPK 


543 


KIPIKIKIKIVK 


544 


RIIIR1RIRIIR 


545 


RIIIRIRIRIIR 


546 


RIIIR1RIRIIR 


547 
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RIVIRIRIRLIR 


548 


RIIVRIRLRIIR 


549 


RIG1RLRVRIIR 


550 


KIV1RIRIRLIR 


551 


RIAVKWRLRFIK 


552 


KIGWKLRVRIIR 


553 


KKIGWL1IRVRR 


554 


RIVIRIRIRLIRIR 


555 


RIIVRIRLRIIRVR 


556 


RIGIRLRVRIIRRV 


557 


KIVIRIRARLIRIRIR 


558 


RIIVKIRLRIIKKIRL 


559 


KIGIKARVRIIRVKII 


560 


RIIVHIRLRIIHHIRL 


561 


HIGIKAHVRIIRVHI1 


562 


RIYVKIHLRYIKKIRL 


563 


KIGHKARVHIIRYKII 


564 


RIYVKPHPRYIKKIRL 


565 


KPGHKARPH1IRYKII 


566 


KIVIRIRIRL1RIRIRKIV 


567 


RIIVKIRLRI1KKIRLIKK 


568 


KIGWKLRVRI1 RVKIGRLR 


569 


KIVIRIRIRLIRIRIRKIVKVKRIR 


570 


RFAVKIRLRIIKKIRLIKKIRKRVIK 


571 


KAGWKLRVRIIRVKIGRLRKIGWKKRVRIK 


572 


RIYVKPHPRYIKKIRL 


573 


KPGHKARPH1IRYKII 


574 


KIVIRIRIRLIRIRIRKIV 


575 


RIIVKIRLRIIKKIRLIKK 


576 


RIYVSKISIYIKKIRL 


577 


KIVIFTRIRLTSIRIRSIV 


578 


KP 1 H KAR PT1 1 R YKM 1 

1 \l II II \ 1 || | ill 1 | 1 \l VII 


579 


cvclicCKGFFALIPKIISSPLFKTLLSAVC 


580 


CKKGFFALIPKIISSPLFKTLLSAVC 


581 


CKKKGFFALIPKIISSPLFKTLLSAVC 


582 


CyclicCRIVIRIRIRLIRIRC 


583 


CyclicCKPGHKARPHIIRYKIIC 


584 


CyclicCRFAVKIRLRIIKKIRLIKKIRKRVIKC 


585 


KLLLKLLL KLLKC 


586 


KLLLKLLLKLLK 


587 


KLLLKLKLKLLKC 


588 


KLLLKLLLKLLK 


589 
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Table 12 — VIP-mimetic peptide sequences 





ID J\l O: 


HSDAVFYDNYTR LRKQMAVKKYLN SILN 


590 


Nle HSDAVFYDNYTR LRKQMAVKKYLN SILN 


591 


✓x^ ✓x 1 A 2 


592 


X a S X d LN 


593 


NH CH CO KKYX5 NH CH CO X6 


594 


1 1 




KKYI 




MQII M 

INOIL-IN 




KKYI 
r\rv T 1— 


^7 


r\r\ Y M 


oyo 


A\/KKYf 


oyy 


iNOll— IN 


600 


kkyv 

PvPv T V 


601 


Oil oi iM 
OlL-dUIN 


609 


kkyi Mio 




IN.O T 1— IN 


60d 
out 


INOl T IN 


60^ 


KKYI PPMQII N 


606 


1 ai iKKYI 
LdUi\r\ T L. 


Uu/ 


PnnKKYI 


fins 


KYI 

l\ 1 L 




KKYNIIf* 
r\r\ T IN It; 


609 


VKKYI 
v r\r\ t i— 


610 


1 MQII M 

L_INOll_IN 


611 

OX Jl 


Yl M<^ll M 

T LlNOJl_IN 


619 


KKYI Nl 
r\r\ i L.1N 


61^ 

(JXt»? 


KKYI MQ 

PviX I LIN O 


614 


r\rv T LINOI 


61 S 


rxrx T LINO l L. 


616 


KKYI 
r\r\ t u. 


617 


KKYDA 
r\rv i Ur\ 


oxo 


AX/KKVI 
/A V rvrv T L 


61 Q 


INOIL.IM 


690 


KKYV 
r\r\ t v 


691 


SILauN 


622 


NSYLN 


623 


NSIYN 


624 


KKYLNIe 


625 


KKYLPPNSILN 


626 


KKYL 


627 


KKYDA 


628 


AVKKYL 


629 


NSILN 


630 


KKYV 


631 


SILauN 


632 
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LaUl\r\Y L 


ODD 


Oapi\l\YL 




k'Vi 
r\ YL 


NTT? 


rvYL 


INK 


r\r\Y INie 


ODD 


Vr\r\YL 


OOO 


LIN ol LIN 


OO/ 


YLNSILN 


638 


KKYLNIe 


639 


KKYLN 


640 


KKYLNS 


641 


KKYLNSI 


642 


KKYLNS1L 


643 


KKKYLD 


644 


cyclicCKKYLC 


645 


CKKYLK 

I ) 


646 


I 1 
S-CH^-CO 




l\i\YA 


04/ 


\fl/\A/TRTr i 1 \A/ 

WW 1 U 1 bLW 


04fco 


\A/\ft/TnPiPI \A/ 

WW 1 UUbLW 


o4y 


WWU 1 HlaLWVW 1 1 


ooU 


r-\ A/r>Kipi/^l\A/l ETO/^ 

rWtaNlJCslWLfcota 


odI 


UvVIJUr'bLVVnbAA 


ODZ 


DXAAP^FlMP 1 \A/\/\/\/l 

riWUUINvaLVV V V VL 


oDo 




DD4 


ri VV UUAu LVV V A 


ODD 


uri xA/oizooixA/ft/ir^iz 

r\LVvotU}lalVV Motz 


ODD 


O Wo fvi n o L W LU 


OD/ 


O HA/HMTP |\A/\ /DP 


ODo 


UWU 1 rltaLWV Y 


ODy 


oLWUtNbAWI 


ooU 


l\WUUnbLVVIvln 


DDI 


OAWMFRftl \A/T 


DDZi 


QWDTRGLWVA 


663 


WNVHGIWQE 


664 


SWDTRGLWVE 


665 


DWDTRGLWVA 


666 


SWGRDGLWIE 


667 


EWTDNGLWAL 


668 


SWDEKGLWSA 


669 


SWDSSGLWMD 


670 
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Table 13 — Mdm/hdm antagonist peptide sequences 



Sequence/structure 


SEQ 




ID NO- 


1 1 OL/I— vv 


iou 


WC I rOL/Lvvr\LLr 


1 31 

lOl 


HPTFQni WKI 1 P 


139 
lOZ 


nFTFQnVWk"! 1 P 
\4tZ I roLJ Y VVr\l_l_n 


1 33 
loo 


nPTFQnv\A/k r i i p 


1 3A 


IVIr^rirlVlLJ Y VvtvjtLlM 


i 3c; 
loo 


\ /P> M P 1 R V\ A/TPi O P 
vuiMrlUY vv I vjkjr 


1 3£ 
lOD 


TG PAFTH YWATF 


137 


IDRAPTFRDHWFALV 


138 


PRPALVFADYWETLY 


139 


PAFSRFWSDLSAGAH 


140 


PAFS RF WSKLS AG AH 


141 


PXFXDYWXXL 


142 


QETFSDLWKLLP 


143 


QPTFSDLWKLLP 


144 


Q ETFSD YWKLLP 


145 


QPTFSDYWKLLP 


146 


Table 14 — Calmodulin antagonist peptide sequences 


S equence/structure 


SEQ 




ID NO: 


SCVKWGKKEFCGS 


164 


SCWKYWGKECGS 


165 


SCYEWGKLRWCGS 


166 


SCLRWGKWSNCGS 


167 


SCWRWGKYQICGS 


168 


SCVSWGALKLCGS 


169 


SCIRWGQNTFCGS 


170 


SCWQWGNLKICGS 


171 


SCVRWGQLSICGS 


172 


LKKFNARRKLKGAILTTMLAK 


173 


RRWKKNFIAVSAANRFKK 


174 


RKWQKTGHAVRAIGRLSS 


175 


INLKALAALAKKIL 


176 


KIWSILAPLGTTLVKLVA 


177 


LKKLLKLLKKLLKL 


178 


LKWKKLLKLLKKLLKKLL 


179 


AEWPSLTEIKTLSHFSV 


180 


AEWPSPTRVISTTYFGS 


181 


AELAHWPPVKTVLRSFT 


182 


AEGSWLQLLNLMKQMNN 


183 


AEWPSLTEIK 


184 



5 
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Table 15 — Mast cell antagonists/Mast cell protease inhibitor 



peptide sequences 



S equence/structure 


SEQ 




ID NO: 


SGSGVLKRPLPILPVTR 


272 


RWLSSRPLPPLPLPPRT 


273 


GSGSYDTLALPSLPLHPMSS 


274 


GSGSYDTRALPSLPLHPMSS 


275 


GSGSSGVTMYPKLPPHWSMA 


276 


GSGSSGVRMYPKLPPHWSMA 


277 


GSGSSSMRMVPTIPGSAKHG 


278 


RNR 


.NR 


QT 


NR 


RQK 


NR 


NRQ 


NR 


RQK 


NR 


RNRQKT 


436 


RNRQ 


437 


RNRQK 


438 


NRQKT 


439 


RQKT 


440 
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Table 16 — SH3 antagonist peptide sequences 



S e a uence/str ucture 


SEQ 




TD MO- 


DDI DDI D 

nrLrrLr 


9£9 
ZOZ 


nri DDI D 

nbLrrLr 


981 
ZO\j 


ODI DDI D 

oPLPPLP 


Zo4 


ODI DDI D 

CaPLPPLP 


9Q£ 
ZOO 


DDI DIDD 
KPLPIPP 


ZoO 


nni nmn 

RPLPIPP 


ZoY 


i — 1 1 — > i n nT n 

RRLPPTP 


Zoo 


RULPP 1 P 


9QQ 

Zo? 


DDI DODD 

RPLPSRP 


9QH 

zyu 


DDI DTDD 
RPLP 1 RP 


9Q1 

zyi 


SRLPPLP 


9Q9 

zyz 


RALPbPP 


9Q1 

zyo 


nm nnTn 

RRLPRTP 


zy4 


DDX/DDIT 
HPVPPI 1 


9QR 
AyO 


ii a nn\ /r~» 

I LAPP VP 


9Q£ 

zyo 


RPLPMLP 


9Q7 

Ay/ 


nni nil n 

RPLPILP 


9QQ 
Zyo 


nni noi n 

RPLPSLP 


9QQ 

zyy 


nni dcm n 

RPLPSLP 


inn 


nni D1V/IID 
RPLPMIP 




DDI DI ID 

RPLPLIP 


ifi9 
olIZ 


DDI DDTD 
RPLPP 1 P 


ini 


DCI DDI D 

RoLPPLP 


10A 
OU4fc 


n nnnnnn 
RPUPPPP 




D/"\l DIDD 

RULPIPP 


DUO 


vwnni nni dvo 
AAAHPLPPLPaP 


aU/ 


VWDDI DDIDW 

aaaRPLPPIPaa 




WVDDI DDI DW 

XaaRPLPPLPaX 




DVVDDI DDI DVD 

HaaRPLPPLPaP 




RYYRP1 PP1 PPP 
nAAnrLrrLrrr 


^11 


PPPYPPPPIPXX 


312 


PPPYPPPPVPXX 


313 


LXXRPLPXW 


314 


TXXRPLPXLP 


315 


PPX9XPPP¥P 


316 


+PPWXKPXWL 


317 


RPXTPTR+SXP 


318 


PPVPPRPXXTL 


319 


YPTLPYK 


320 


+0DXPLPXLP 


321 
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Table 17 — Somatostatin or cortistatin mimetic peptide sequences 



Sequence/structure 


SEQ 
TD WO 


Y 1 -Y 2 _A<sn-Php»-Php-Trn-l v<?-Thr-Php-X 3 -9pr-Y 4 
s\ s\ aaoi i t i itJTi it? i i jj Lyo i i ii ii ie /v oel y\ 


A71 


Aen Am Mot P rn ("Ivo Arn Aon Pho Pho Trn 1 \/q Thr Pho Qor Qor f^\/c 1 \/o 
no[J Ml y IvIeL i III Uyo MI JJ Mol 1 "lie i lie 1 ip Lyo 1 III "lit? Ocl GUI uyo Lyo 


A7A 


l\/Iot Pm r*S/o Arn Aon Dho Pho Trn 1 \/o Thr Pho Oz-\ O^n* r*S/o I \/o 
IVIeL rfU Uyo Miy Moll "Me "lie 1 l\J Lyo 1 III "lit; Ocl Oel Uyo Lyo 


A7K 


r^\/c A rrt Aen Pho Pha Trn I \/o Thr Pho Cor Qor 0\ /o 1 wo 
Uyo Miy Moll rile n 1 tc lip Lyo 1 III "lie Oel Oel Uyo Lyo 


A7& 


Aon Arn l\/Iot Prn i^/o Arn Aen Pho Pho Trn 1 \/o Thr Pho Qor Qor PS/o 
Mop Miy IVIeL rfU Uyo Miy Moll r lit? "lie 1 ip Lyo 1 In "lie Ocl ocl Uyo 


A77 


Mot Prn r^wo Arn Aon Pho Pho Trn 1 \/o Thr Pho Oar Cor rN/o 
IVIeL rfU Uyo Miy Moll "lie "lie 1 ip Lyo 1 III "lit? Oel Ocl Uyo 


A7R 


Owe Arn Aon Pho Pho Trn 1 \/o Thr Pho Qor Qor i\/o 
Uyo Miy Moll "lie "He 1 ip Lyo 1 III r lie Oel Oel Uyo 


A7Q 


Aon Arn h/lot Prn Owe 1 \/o Aon Pho Pho Trn 1 wo Thr Pho Qor Qor f"*wo 
Mop Miy IVJeL "I <J Uyo Lyo Moll "lie rile 1 ip Lyo 1 III "lie Oel Oel Uyo 


ARC) 


ly/lot Prn r"S/o 1 \/o Aon Pho Pho Trn 1 \/o Thr Pho Qor Qor i"*wo 1 wo 
IVJeL rlU Uyo Lyo Moll i lie "lie 1 IjJ Lyo t III "lie Oel Oel Uyo Lyo 


AR1 


Owo 1 \ /o Aon Pho Pho Trn 1 \ /o Thr Pho Qor Qor /o I \ /o 
Uyo Lyo Moll "lie "lie 1 l[J Lyo 1 III "lie Oel Oel Uyo Lyo 




Aon Am rAai: Prn t"*wo I \/o Aon Pho Pho Trn I \ie Thr Pho Qor Qor C^\ic 
Mop Miy IVIeL rlu Uyo Lyo Mom "lie rile l Tp Lyo 1 III "lie Oel Oel Uyo 


AR^ 


IVVlot Prn i"*\/o 1 \/o Aon Pho Pho Trn 1 \/o Thr Pho Qor Qor P\/o 
IVIeL rlU Uyo Lyo Moll "lie "lie 1 ip Lyo 1 III "lie Oel Oel Uyo 


A9KA 


P\/o 1 \/o Aon Pho Pho Trn 1 \/o Thr Pho Qor Qor i^i/o 
Uyo Lyo Moll "lie "lie 1 ip Lyo 1 Mi "lie Oel Oel Uyo 


ARK 


Aon Arn IV/lot Prn Owe Arn Aon Pho Pho Trn 1 \/o Thr Pho Thr Qor Owe 1 \/e 
Mop Mi y IVIel rlu Uyo MI y Mol 1 "lie "I le lip Lyo 1 I II i 1 le I I II Oel Uyo Lyo 




Mpt Prn Dvci Am A^n Php Php Trn 1 V9. Thr Php Thr 9pr Hv^ 1 v«i 

ivieL i i kj uyo v\i y / vo 1 1 n ic rue i i \~> i — y o iiii riie iiii oei v_/y o t_yo 


487 


Cys Arg Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys Lys 


488 


Asp Arg Met Pro Cys Arg Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys 


489 


Met Pro Cys Arg Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys 


490 


Cys Arg Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys 


491 


Asp Arg Met Pro Cys Lys Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys Lys 


492 


Met Pro Cys Lys Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys Lys 


493 


Cys Lys Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys Lys 


494 


Asp Arg Met Pro Cys Lys Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys 


495 


Met Pro Cys Lys Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys 


496 


Cys Lys Asn Phe Phe Trp Lys Thr Phe Thr Ser Cys 


497 



-56- 



WO 01/83525 



PCT/US01/14310 



Table 18 — UKR antagonist peptide sequences 



S p a 11 pn c p/stru c tu rp 


SEO 






AbrMrnoLNroUYLWY 1 


1 

l^O 


Abn 1 YbbLWD 1 YorLAr 


1 07 

±y/ 


a r— i r-v 1 \A/|\/ir~)LJ\/DI OCTOMD 

AELDLWMRHYPLbFbNR 


i no 


AESSLWTRYAWPbMPbY 




AC\ftfUn/^l nrr> 0\/l \ A /C'l/T 

AEWHPGLSFGbYLWbKT 


200 


ai — dai i Mu/occrMnni 1 1 

AEPALLN Wo F F FN PG LH 


oai 

201 


AE W S F YN LH LP EPQT1 F 


O AO 

202 


a r~ r - * i r-vi \a/oi \/oi nni ah 

AEPLDLWSLYSLPPLAM 


o ao 

203 


AEPTLWQLYQFPLRLSG 


OA/1 

204 


AEISFSELMWLRSTPAF 


o nn 

205 


AELSEADLWTTWFGMGS 


OA/T 

206 


A CIO CM \AADICCDOAI IVJlMC 

AtooLVVHIrbroALIViMb 


ZO/ 


AESLPTLTSILWGKESV 


208 


AETLFMDLWHDKH1LLT 


209 


AEILNFPLWHEPLWSTE 


210 


AESQTGTLNTLFWNTLR 


211 


AEPVYQYELDSYLRSYY 


430 


AELDLSTFYD IQ YLLRT 


431 


AEFFKLGPNGYVYLHSA 


432 


FKLXXXGYVYL 


433 


AESTYH H LS LG YM YTLN 


434 


YHXLXXGYMYT 


435 
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Table 19 — Macrophage and/or 



T-cell inhibiting peptide sequences 



Sequence/structure 


SEQ 




ID NO: 


Xaa-Yaa-Arg 


NR 


Arg-Yaa-Xaa 


NR 


Xaa-Arg-Yaa 


NR 


Yaa-Arg-Xaa 


NR 


Ala~Arg 


NR 


Arq-Arq 


NR 


Asn-Arg 


NR 


Asp-Arg 


NR 


Cys-Arg 


NR 


Gin-Arg 


NR 


Glu-Arg 


NR 


Gly-Arg 


NR 


His-arg 


NR 


iie-Arg 


NR 


Leu~Arg 


NR 


Lys-Arg 


* NR 


Met-Arg 


NR 


Phe-Arg 


NR 


Ser-Arg 


NR 


Thr-Arg 


NR 


Trp-Arg 


NR 


Tyr-Arg 


NR 


Val-Arg 


NR 


Ala-Glu-Arg 


NR 


Arg-Glu-Arg 


NR 


Asn-Glu-Arg 


NR 


Asp-GIu-Arg 


NR 


Cys-Glu-Arg 


NR 


GIn-Glu-Arg 


NR 


Glu-Glu-Arg 


NR 


Gly-GIu-Arg 


NR 


His-Glu-Arg 


NR 


lle-Glu-Arg 


NR 


Leu-Glu-Arg 


NR 


Lys-Glu-Arg 


NR 


Met-Glu-Arg 


NR 


Phe-Glu-Arg 


NR 


Pro-Glu-Arg 


NR 


Ser-Glu-Arg 


NR 


Thr-Glu-Arg 


NR 


Trp-Glu-Arg 


NR 


Tyr-Glu-Arg 


NR 


Val-Glu-Arg 


NR 


Arg-Ala 


NR 
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Arg-Asp 


NR 


Arg-Cys 


NR 


Arg-GIn 


NR 


Arg-Glu 


NR 


Arg-Gly 


NR 


Arg-His 


NR 


Arg-IIe 


NR 


Arg-Leu 


NR 


Arg-Lys 


NR 


Arg-Met 


NR 


Arg-Phe 


NR 


Arg-Pro 


NR 


Arg-Ser 


NR 


Arg-Thr 


NR 


Arg-Trp 


NR 


Arg-Tyr 


NR 


Arg-Va! 


NR 


Arg-Glu-Ala 


NR 


Arg-Glu-Asn 


NR 


Arg-Glu-Asp 


NR 


Arg-Glu-Cys 


NR 


Arg-Glu-GIn 


NR 


Arg-Glu-Glu 


NR 


Arg-Glu-Gly 


NR 


Arg-Glu-His 


NR 


Arg-Glu-Ile 


NR 


Arg-Glu-Leu 


NR 


Arg-Glu-Lys 


NR 


Arg-Glu-Met 


NR 


Arg-Glu-Phe 


NR 


Arg-Glu-Pro 


NR 


Arg-Glu-Ser 


NR 


Arg-Glu-Thr 


NR 


Arg-Glu-Trp 


NR 


Arg-Glu-Tyr 


NR 


Arg-Glu-Val 


NR 


Ala- Arg-Glu 


NR 


Arg-Arg-Glu 


NR 


Asn-Arg-Glu 


NR 


Asp-Arg-Glu 


NR 


Cys-Arg-Glu 


NR 


Gln-Arg-Glu 


NR 


Glu-Arg-Glu 


NR 


Gly-Arg-Glu 


NR 


His-Arg-Glu 


NR 


lle-Arg-Glu 


NR 


Leu-Arg-GIu 


NR 


Lys-Arg-Glu 


NR 


Met-Arg-Glu 


NR 


Phe-Arg-Glu 


NR 
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Pro-Arg-Glu 


NR 


Ser-Arg-Glu 


NR 


Thr-Arg-Glu 


NR 


Trp-Arg-Glu 


NR 


Tyr-Arg-Glu 


NR 


Val-Arg-Glu 


NR 


Glu-Arg-Ala, 


NR 


GIu-Arg-Arg 


NR 


GIu-Arg-Asn 


NR 


GIu-Arg-Asp 


NR 


Glu-Arg-Cys 


NR 


Glu-Arg-GIn 


NR 


Glu-Arg-Gly 


NR 


Glu-Arg-His 


NR 


Glu-Arg-lle 


NR 


G!u-Arg-Leu 


NR 


GIu-Arg-Lys 


NR 


Glu-Arg-Met 


NR 


Glu-Arg-Phe 


NR 


Glu-Arg-Pro 


NR 


Glu-Arg-Ser 


NR 


Glu-Arg-Thr 


NR 


Glu-Arg-Trp 


NR 


Glu-Arg-Tyr 


NR 


Glu-Arg-Val 


NR 
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Table 20 — Additional Exemplary Pharmacologically Active Peptides 



Secfuence/stnicture 


SEQ 
ID 
NO: 


Activity 


VEPNCDIHVMWEWECFERL 


1027 


VEGF-antagonist 


GERWCFDGPLTWVCGEES 


1084 


VEGF-antagonist 


RGWVEICVADDNGMCVTEAQ 


1085 


VEGF-antagonist 


GWDECDVARMWEWECFAGV 


1086 


VEGF- antagonist 


GERWCFDGPRAWVCGWEI 


501 


VEGF- antagonist 


EELWCFDGPRAWVCGYVK 


502 


VEGF- antagonist 


RGWVEICAADDYGRCLTEAQ 


1031 


VEGF- antagonist 


RGWVEICESDVWGRCL 


1087 


VEGF- antagonist 


RGWVEICESDVWGRCL 


1088 


VEGF- antagonist 


GGNECDIARMWEWECFERL 


1089 


VEGF- antagonist 


RGWVEICAADDYGRCL 


1090 


VEGF-antagonist 


CTTHWGFTLC 


1028 


MMP inhibitor 


CLRSGXGC 


1091 


MMP inhibitor 


CXXHWGFXXC 


1092 


MMP inhibitor , 


CXPXC 


1093 


MMP inhibitor 


CRRHWGFEFC 


1094 


MMP inhibitor 


STTHWGFTLS 


1095 


MMP inhibitor 


CSLHWGFWWC 


1096 


CTLA4-mimetic 


GFVCSGIFAVGVGRC 


125 


CTLA4-mimetic 


APGVRLGCAVLGRYC 


126 


CTLA4-mimetic 


LLGRMK 


105 


Antiviral (HBV) 


ICVVQDWG HH RCTAGHMANLTSHASAI 


127 


C3b antagonist 


1CVVQDWGHHRCT 


128 


C3b antagonist 


CVVQDWGHHAC 


129 


C3b antagonist 


STGGFDDVYDWARGVSSALTTTLVATR 


185 


Vinculin-binding 


STGG FDD VYDWARR VSSALTTTLVATR 


186 


Vinculin-binding 


SRGVNFSEWLYDMSAAMKEASNVFPSRRSR 


187 


Vinculin-binding 


SSQNWDMEAGVEDLTAAMLGLLSTIHSSSR 


188 


Vinculin-binding 


SSPSLYTQFLVNYESAATRIQDLLIASRPSR 


189 


Vinculin-binding 


SSTGWVDLLGALQRAADATRTSIPPSLQNSR 


190 


Vinculin-binding 


DVYTKKELIECARRVSEK 


191 


Vinculin-binding 


EKGSYYPGSGIAQFHIDYNNVS 


192 


C4BP-binding 


SG 1 AQ FH 1 D YN N VSS AEG WH VN 


193 


C4BP-binding 


LVTVEKG S YYPG SG 1 AQ FH 1 D YNN VSS AEG WH VN 


194 


C4BP-binding 


SGIAQFHIDYNNVS 


195 


C4BP-binding 


LLGRMK 


279 


anti-HBV 


ALLGRMKG 


280 


anti-HBV 


LDPAFR 


281 


anti-HBV 


CXXRGDC 


322 


Inhibition of platelet 
aggregation 


RPLPPLP 


323 


Src antagonist 


PPVPPR 


324 


Src antagonist 


XFXDXWXXLXX 


325 


Anti-cancer 
(particularly for 
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sarcomas) 


KACRRLFGPVDSEQLSRDCD 


326 


p16-mimetic 


RERWNFDFVTETPLEGDFAW 


327 


p16-mimetic 


KRRQTSMTDFYHSKRRLIFS 


328 


p16-mimetic 


TSMTDFYHSKRRLIFSKRKP 


329 


p16-mimetic 


RRUF 


330 


p16-mimetic 


KRRQTSATDFYHSKRRLIFSRQIKIWFQNRRMKWKK 


331 


p16-mimetic 


KRRLIFSKRQIKIWFQNRRMKWKK 


332 


p16-mimetic 


Asn Gin Gly Arg His Phe Gys Gly Gly Ala Leu lie His Ala 

A rr\ Pho \/ol [Mat "Thr Ala Ala Qar fS/o Phei f^iln 

Arg rne vai iviei i nr Mia Mia oer oys rne vjun 


a no 


UAro/ mimetic/Lrb 
uinuiny 


Am Hid Php f^k/ Ala 1 pfi Mp Hiq AIps Am PhA 
Miy nio ~i it? wyo \_^iy ly Mid t-c?u lit? nio Mid Miy r Me? vai 
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antiviral 
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CVHTPRS 
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antiviral 
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antiviral 


HWAWFK 


1140 


anti-ischemic, growth 
hormone-liberating 



The present invention is also particularly useful with peptides 
having activity in treatment of: 

• cancer, wherein the peptide is a VEGF-mimetic or a VEGF receptor 

5 antagonist, a HER2 agonist or antagonist, a CD20 antagonist and the 

like; 

• asthma, wherein the protein of interest is a CKR3 antagonist, an IL-5 
receptor antagonist, and the like; 

• thrombosis, wherein the protein of interest is a GPIIb antagonist, a 
1 0 GPIIIa antagonist, and the like; 

• autoimmune diseases and other conditions involving immune 
modulation, wherein the protein of interest is an IL-2 receptor 
antagonist, a CD40 agonist or antagonist, a CD40L agonist or 
antagonist, a thymopoietin mimetic and the like. 

15 Vehicles . This invention requires the presence of at least one vehicle 

(F 1 , F 2 ) attached to a peptide through the N-terminus, C-terminus or a 
sidechain of one of the amino acid residues. Multiple vehicles may also be 
used; e.g., Fc's at each terminus or an Fc at a terminus and a PEG group at 
the other terminus or a sidechain. 

2 0 An Fc domain is the preferred vehicle. The Fc domain may be fused 

to the N or C termini of the peptides or at both the N and C termini. For 
the TPO-mimetic peptides, molecules having the Fc domain fused to the N 
terminus of the peptide portion of the molecule are more bioactive than 
other such fusions, so fusion to the N terminus is preferred. 
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As noted above, Fc variants are suitable vehicles within the scope of 
this invention. A native Fc may be extensively modified to form an Fc 
variant in accordance with this invention, provided binding to the salvage 
receptor is maintained; see, for example WO 97/34631 and WO 96/32478. 
5 In such Fc variants, one may remove one or more sites of a native Fc that 
provide structural features or functional activity not required by the 
fusion molecules of this invention. One may remove these sites by, for 
example, substituting or deleting residues, inserting residues into the site, 
or truncating portions containing the site. The inserted or substituted 
10 residues may also be altered amino acids, such as peptidomimetics or D- 
amino acids. Fc variants may be desirable for a number of reasons, several 
of which are described below. Exemplary Fc variants include molecules 
and sequences in which: 

1. Sites involved in disulfide bond formation are removed. Such removal 
15 may avoid reaction with other cysteine-containing proteins present in 

the host cell used to produce the molecules of the invention. For this 
purpose, the cysteine-containing segment at the N-terminus may be 
truncated or cysteine residues may be deleted or substituted with other 
amino acids (e.g., alanyl, seryl). In particular, one may truncate the N- 
2 0 terminal 20-amino acid segment of SEQ ID NO: 2 or delete or 

substitute the cysteine residues at positions 7 and 10 of SEQ ID NO: 2. 
Even when cysteine residues are removed, the single chain Fc domains 
can still form a dimeric Fc domain that is held together non-covalently. 

2. A native Fc is modified to make it more compatible with a selected host 
2 5 cell. For example, one may remove the PA sequence near the N- 

terminus of a typical native Fc, which may be recognized by a digestive 
enzyme in E. coli such as proline iminopeptidase. One may also add an 
N-terminal methionine residue, especially when the molecule is 
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expressed recombinantly in a bacterial cell such as E. colL The Fc 
domain of SEQ ID NO: 2 (Figure 4) is one such Fc variant. 

3. A portion of the N-terminus of a native Fc is removed to prevent N- 
terminal heterogeneity when expressed in a selected host cell. For this 

5 purpose, one may delete any of the first 20 amino acid residues at the 

N-terminus, particularly those at positions 1, 2, 3, 4 and 5. 

4. One or more glycosylation sites are removed. Residues that are 
typically glycosylated (e.g., asparagine) may confer cytolytic response. 
Such residues may be deleted or substituted with unglycosylated 

10 residues (e.g., alanine). 

5. Sites involved in interaction with complement, such as the Clq binding 
site, are removed. For example, one may delete or substitute the EKK 
sequence of human IgGl. Complement recruitment may not be 
advantageous for the molecules of this invention and so may be 

1 5 avoided with such an Fc variant. 

6. Sites are removed that affect binding to Fc receptors other than a 
salvage receptor. A native Fc may have sites for interaction with 
certain white blood cells that are not required for the fusion molecules 
of the present invention and so may be removed. 

2 0 7. The ADCC site is removed. ADCC sites are known in the art; see, for 
example, Molec. Immunol . 29 (5): 633-9 (1992) with regard to ADCC 
sites in IgGl. These sites, as well, are not required for the fusion 
molecules of the present invention and so may be removed. 
8. When the native Fc is derived from a non-human antibody, the native 

25 Fc may be humanized. Typically, to humanize a native Fc, one will 

substitute selected residues in the non-human native Fc with residues 
that are normally found in human native Fc. Techniques for antibody 
humanization are well known in the art. 
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Preferred Fc variants include the following. In SEQ ID NO: 2 
(Figure 4) the leucine at position 15 may be substituted with glutamate; the 
glutamate at position 99, with alanine; and the lysines at positions 101 and 
103, with alanines. In addition, one or more tyrosine residues can be 
5 replaced by pheny alanine residues. 

An alternative vehicle would be a protein, polypeptide, peptide, 
antibody, antibody fragment, , or small molecule (e.g., a peptidomimetic 
compound) capable of binding to a salvage receptor. For example, one 
could use as a vehicle a polypeptide as described in U.S. Pat. No. 5,739,277, 

10 issued April 14, 1998 to Presta etal. Peptides could also be selected by 
phage display for binding to the FcRn salvage receptor. Such salvage 
receptor-binding compounds are also included within the meaning of 
"vehicle " and are within the scope of this invention. Such vehicles should 
be selected for increased half-life (e.g., by avoiding sequences recognized 

15 by proteases) and decreased immunogenicity (e.g., by favoring non- 
immunogenic sequences, as discovered in antibody humanization). 

As noted above, polymer vehicles may also be used for F 1 and F 2 . 
Various means for attaching chemical moieties useful as vehicles are 
currently available, see, e.g., Patent Cooperation Treaty ("PCT") 

2 0 International Publication No. WO 96/11953, entitled "N-Terminally 
Chemically Modified Protein Compositions and Methods/' herein 
incorporated by reference in its entirety. This PCT publication discloses, 
among other things, the selective attachment of water soluble polymers to 
the N-terminus of proteins. 

2 5 A preferred polymer vehicle is polyethylene glycol (PEG). The PEG 

group may be of any convenient molecular weight and may be linear or 
branched. The average molecular weight of the PEG will preferably range 
from about 2 kiloDalton ("kD") to about 100 kDa, more preferably from 
about 5 kDa to about 50 kDa, most preferably from about 5 kDa to about 
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10 kDa. The PEG groups will generally be attached to the compounds of 
the invention via acylation or reductive alkylation through a reactive 
group on the PEG moiety (e.g., an aldehyde, amino, thiol, or ester group) 
to a reactive group on the inventive compound (e.g., an aldehyde, amino, 
5 or ester group). 

A useful strategy for the PEGylation of synthetic peptides consists 
of combining, through forming a conjugate linkage in solution, a peptide 
and a PEG moiety, each bearing a special functionality that is mutually 
reactive toward the other. The peptides can be easily prepared with 

1 0 conventional solid phase synthesis (see, for example, Figures 5 and 6 and 
the accompanying text herein). The peptides are "preactivated" with an 
appropriate functional group at a specific site. The precursors are purified 
and fully characterized prior to reacting with the PEG moiety. Ligation of 
the peptide with PEG usually takes place in aqueous phase and can be 

1 5 easily monitored by reverse phase analytical HPLC. The PEGylated 

peptides can be easily purified by preparative HPLC and characterized by 
analytical HPLC, amino acid analysis and laser desorption mass 
spectrometry. 

Polysaccharide polymers are another type of water soluble polymer 
2 0 which may be used for protein modification. Dextrans are polysaccharide 
polymers comprised of individual subunits of glucose predominantly 
linked by al-6 linkages. The dextran itself is available in many molecular 
weight ranges, and is readily available in molecular weights from about 1 
kD to about 70 kD. Dextran is a suitable water soluble polymer for use in 
2 5 the present invention as a vehicle by itself or in combination with another 
vehicle (e.g., Fc). See, for example, WO 96/11953 and WO 96/05309. The 
use of dextran conjugated to therapeutic or diagnostic immunoglobulins 
has been reported; see, for example, European Patent Publication No. 0 
315 456, which is hereby incorporated by reference. Dextran of about 1 kD 
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to about 20 kD is preferred when dextran is used as a vehicle in 
accordance with the present invention. 

Linkers . Any "linker" group is optional. When present, its chemical 
structure is not critical, since it serves primarily as a spacer. The linker is 
5 preferably made up of amino acids linked together by peptide bonds. 
Thus, in preferred embodiments, the linker is made up of from 1 to 20 
amino acids linked by peptide bonds, wherein the amino acids are selected 
from the 20 naturally occurring amino acids. Some of these amino acids 
may be glycosylated, as is well understood by those in the art. In a more 
10 preferred embodiment, the 1 to 20 amino acids are selected from glycine, 
alanine, proline, asparagine, glutamine, and lysine. Even more preferably, 
a linker is made up of a majority of amino acids that are sterically 
unhindered, such as glycine and alanine. Thus, preferred linkers are 
polyglycines (particularly (Gly) 4 , (Gly) 5 ), poly(Gly-Ala), and polyalanines. 
15 Other specific examples of linkers are: 

(Gly) 3 Lys(Gly) 4 (SEQ ID NO: 333); 
(Gly) 3 AsnGlySer(Gly) 2 (SEQ ID NO: 334); 
(Gly) 3 Cys(Gly) 4 (SEQ ID NO: 335); and 
GlyProAsnGlyGly (SEQ ID NO: 336). 
20 To explain the above nomenclature, for example, (Gly) 3 Lys(Gly) 4 means 
Gly-Gly-Gly-Lys-Gly-Gly-Gly-Gly. Combinations of Gly and Ala are also 
preferred. The linkers shown here are exemplary; linkers within the scope 
of this invention may be much longer and may include other residues. 
Non-peptide linkers are also possible. For example, alkyl linkers 
2 5 such as -NH-(CH 2 ) s -C(0)-, wherein s = 2-20 could be used. These alkyl 
linkers may further be substituted by any non-sterically hindering group 
such as lower alkyl (e.g., C r C 6 ) lower acyl, halogen (e.g., CI, Br), CN, NH 2 , 
phenyl, etc. An exemplary non-peptide linker is a PEG linker, 
VI 



-68- 



WO 01/83525 



PCT/US01/14310 




wherein n is such that the linker has a molecular weight of 100 to 5000 kD, 
preferably 100 to 500 kD. The peptide linkers may be altered to form 
5 derivatives in the same manner as described above. 

Derivatives . The inventors also contemplate derivatizing the 
peptide and/ or vehicle portion of the compounds. Such derivatives may 
improve the solubility, absorption, biological half life, and the like of the 
compounds. The moieties may alternatively eliminate or attenuate any 
1 0 undesirable side-effect of the compounds and the like. Exemplary 
derivatives include compounds in which: 

1. The compound or some portion thereof is cyclic. For example, the 
peptide portion may be modified to contain two or more Cys residues 
(e.g., in the linker), which could cyclize by disulfide bond formation. 

15 For citations to references on preparation of cyclized derivatives, see 

Table 2. 

2. The compound is cross-linked or is rendered capable of cross-linking 
between molecules. For example, the peptide portion may be modified 
to contain one Cys residue and thereby be able to form an 

2 0 intermolecular disulfide bond with a like molecule. The compound 

may also be cross-linked through its C-terminus, as in the molecule 
shown below. 
VII 

O 




F 1 -(X 1 ) b -C(>N 

F 1 -(X 1 ) b -CO-N^Y 

O 



NH 
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4 . One or more peptidyl [-C(0)NR-] linkages (bonds) is replaced by a 
non-peptidyl linkage. Exemplary non-peptidyl linkages are -CH 2 - 
carbamate [-CH 2 -OC(0)NR-], phosphonate , -CH 2 -sulfonamide [-CH 2 - 
S(0) 2 NR-], urea [-NHC(0)NH-], -CH 2 -secondary amine, and alkylated 
5 peptide [-C(0)NR 6 - wherein R 6 is lower alkyl]. 

5. The N-terrninus is derivatized. Typically, the N-terminus may be 
acylated or modified to a substituted amine. Exemplary N-terminal 
derivative groups include -NRR 1 (other than -NH 2 ), -NRC(0)R 1 / 
-NRC(0)OR 1 / -NRS^R 1 , -NHC^NHR 1 , succinimide, or 

1 0 benzyloxycarbonyl-NH- (CBZ-NH-), wherein R and R 1 are each 

independently hydrogen or lower alkyl and wherein the phenyl ring 
may be substituted with 1 to 3 substituents selected from the group 
consisting of Q-Q alkyl, Q-Q alkoxy, cWoro, and bromo. 

6. The free C-terminus is derivatized. Typically, the C-ter minus is 

15 esterified or amidated. For example, one may use methods described in 

the art to add (NH-CH 2 -CH 2 -NH 2 ) 2 to compounds of this invention 
having any of SEQ ID NOS: 504 to 508 at the C-terminus. Likewise, 
one may use methods described in the art to add -NH 2 to compounds 
of this invention having any of SEQ ID NOS: 924 to 955, 963 to 972, 

2 0 1005 to 1013, or 1018 to 1023 at the C-terminus. Exemplary C-terminal 

derivative groups include, for example, -C(0)R 2 wherein R 2 is lower 
alkoxy or -NR 3 R 4 wherein R 3 and R 4 are independently hydrogen or Q- 
C 8 alkyl (preferably C r C 4 alkyl). 

7. A disulfide bond is replaced with another, preferably more stable, 
2 5 cross-linking moiety (e.g., an alkylene). See, e.g., Bhatnagar et aL 

(1996), T. Med. Chem . 39: 3814-9; Alberts etal. (1993) Thirteenth Am. 
Pep. Symp ., 357-9. 
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8. One or more individual amino acid residues is modified. Various 
derivatizing agents are known to react specifically with selected 
sidechains or terminal residues, as described in detail below. 

Lysinyl residues and amino terminal residues may be reacted with 
5 succinic or other carboxylic acid anhydrides, which reverse the charge of the 
lysinyl residues. Other suitable reagents for derivatizing alpha-amino- 
containing residues include imidoesters such as methyl picolinimidate; 
pyridoxal phosphate; pyridoxal; chloroborohydride; trinitrobenzenesulfonic 
acid; O-methylisourea; 2,4 pentanedione; and transaminase-catalyzed reaction 
1 0 with gly oxylate . 

Arginyl residues may be modified by reaction with any one or 
combination of several conventional reagents, including phenylglyoxal, 2,3- 
butanedione, 1,2-cyclohexanedione, and ninhydrin. Derivatization of arginyl 
residues requires that the reaction be performed in alkaline conditions because 
15 of the high pKa of the guanidine functional group. Furthermore, these reagents 
may react with the groups of lysine as well as the arginine epsilon-amino 
group. 

Specific modification of tyrosyl residues has been studied extensively, 

with particular interest in introducing spectral labels into tyrosyl residues by 
2 0 reaction with aromatic diazonium compounds or tetranitromethane. Most 

commonly, N-acetylimidizole and tetranitromethane are used to form O-acetyl 

tyrosyl species and 3-nitro derivatives, respectively. 

Carboxyl sidechain groups (aspartyl or glutamyl) may be selectively 

modified by reaction with carbodiimides (R -N=C=N-R / ) such as 1-cyclohexyl- 
2 5 3-(2-morpholinyl~(4-ethyl) carbodiimide or l-ethyl-3-(4-azonia-4,4- 

dimethylpentyl) carbodiimide. Furthermore, aspartyl and glutamyl residues 

may be converted to asparaginyl and glutaminyl residues by reaction with 

ammonium ions. 
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Glutaminyl and asparaginyl residues may be deamidated to the 
corresponding glutamyl and aspartyl residues. Alternatively, these residues 
are deamidated under mildly acidic conditions. Either form of these residues 
falls within the scope of this invention. 
5 Cysteinyl residues can be replaced by amino acid residues or other 

moieties either to eliminate disulfide bonding or, conversely, to stabilize cross- 
linking. See, e.g., Bhatnagar etal. (1996), T. Med. Chem . 39: 3814-9. 

Derivatization with bifunctional agents is useful for cross-linking the 
peptides or their functional derivatives to a water-insoluble support matrix or 

10 to other macromolecular vehicles. Commonly used cross-linking agents 
include, e.g., l,l-bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N- 
hydroxysuccinimide esters, for example, esters with 4-azidosalicylic acid, 
homobifunctional imidoesters, including disuccinimidyl esters such as 3,3 - 
dithiobis(succinimidylpropionate), and bifunctional maleimides such as bis-N- 

15 maleimido-l,8-octane. Derivatizing agents such as methyl-3-[(p- 

azidophenyl)dithio]propioimidate yield photoactivatable intermediates that are 
capable of forming crosslinks in the presence of light. Alternatively, reactive 
water-insoluble matrices such as cyanogen bromide-activated carbohydrates 
and the reactive substrates described in U.S. Pat. Nos. 3,969,287; 3,691,016; 

2 0 4,195,128; 4,247,642; 4,229,537; and 4,330,440 are employed for protein 
immobilization. 

Carbohydrate (oligosaccharide) groups may conveniently be 
attached to sites that are known to be glycosylation sites in proteins. 
Generally, O-linked oligosaccharides are attached to serine (Ser) or 

2 5 threonine (Thr) residues while N-linked oligosaccharides are attached to 
asparagine (Asn) residues when they are part of the sequence Asn-X- 
Ser/Thr, where X can be any amino acid except proline. X is preferably 
one of the 19 naturally occurring amino acids other than proline. The 
structures of N-linked and O-linked oligosaccharides and the sugar 
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residues found in each type are different. One type of sugar that is 
commonly found on both is N~acetylneuraminic acid (referred to as sialic 
acid). Sialic acid is usually the terminal residue of both N-linked and O- 
linked oligosaccharides and, by virtue of its negative charge, may confer 
5 acidic properties to the glycosylated compound. Such site(s) may be 
incorporated in the linker of the compounds of this invention and are 
preferably glycosylated by a cell during recombinant production of the 
polypeptide compounds (e.g., in mammalian cells such as CHO, BHK, 
COS). However, such sites may further be glycosylated by synthetic or 

1 0 semi-synthetic procedures known in the art. 

Other possible modifications include hydroxylation of proline and 
lysine, phosphorylation of hydroxyl groups of seryl or threonyl residues, 
oxidation of the sulfur atom in Cys, methylation of the alpha-amino 
groups of lysine, arginine, and histidine side chains. Creighton, Proteins: 

1 5 Structure and Molecule Properties (W. H. Freeman & Co., San Francisco), 
"pp. 79-86 (1983). 

Compounds of the present invention may be changed at the DNA 
level, as well. The DNA sequence of any portion of the compound may be 
changed to codons more compatible with the chosen host cell. For E. coli, 

2 0 which is the preferred host cell, optimized codons are known in the art. 

Codons may be substituted to eliminate restriction sites or to include silent 
restriction sites, which may aid in processing of the DNA in the selected 
host cell. The vehicle, linker and peptide DNA sequences may be modified 
to include any of the foregoing sequence changes. 

2 5 Isotope- and toxin-conjugated derivatives . Another set of useful 

derivatives are the above-described molecules conjugated to toxins, 
tracers, or radioisotopes. Such conjugation is especially useful for 
molecules comprising peptide sequences that bind to tumor cells or 
pathogens. Such molecules may be used as therapeutic agents or as an aid 
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to surgery (e.g., radioimmunoguided surgery or RIGS) or as diagnostic 
agents (e.g., radioimmunodiagnostics or RID). 

As therapeutic agents, these conjugated derivatives possess a 
number of advantages. They facilitate use of toxins and radioisotopes that 
5 would be toxic if administered without the specific binding provided by 
the peptide sequence. They also can reduce the side-effects that attend the 
use of radiation and chemotherapy by facilitating lower effective doses of 
the conjugation partner. 

Useful conjugation partners include: 
10 • radioisotopes, such as 90 Yttrium, 131 Iodine, ^Actinium, and 

213 Bismuth; 

• ricin A toxin, microbially derived toxins such as Pseudomonas 
endotoxin (e.g., PE38, PE40), and the like; 

• partner molecules in capture systems (see below); 

15 • biotin, streptavidin (useful as either partner molecules in 

capture systems or as tracers, especially for diagnostic use); and 

• cytotoxic agents (e.g., doxorubicin). 

One useful adaptation of these conjugated derivatives is use in a 
capture system. In such a system, the molecule of the present invention 

2 0 would comprise a benign capture molecule. This capture molecule would 
be able to specifically bind to a separate effector molecule comprising, for 
example, a toxin or radioisotope. Both the vehicle-conjugated molecule 
and the effector molecule would be administered to the patient. In such a 
system, the effector molecule would have a short half-life except when 

2 5 bound to the vehicle-conjugated capture molecule, thus minimizing any 

toxic side-effects. The vehicle-conjugated molecule would have a relatively 
long half-life but would be benign and non-toxic. The specific binding 
portions of both molecules can be part of a known specific binding pair 
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(e.g., biotin, streptavidin) or can result from peptide generation methods 
such as those described herein. 

Such conjugated derivatives may be prepared by methods known 
in the art. In the case of protein effector molecules (e.g., Pseudomonas 
5 endotoxin), such molecules can be expressed as fusion proteins from 

correlative DNA constructs. Radioisotope conjugated derivatives may be 
prepared, for example, as described for the BEX A antibody (Coulter). 
Derivatives comprising cytotoxic agents or microbial toxins may be 
prepared, for example, as described for the BR96 antibody (Bristol-Myers 
1 0 Squibb). Molecules employed in capture systems may be prepared, for 

example, as described by the patents, patent applications, and publications 
from NeoRx. Molecules employed for RIGS and RID may be prepared, for 
example, by the patents, patent applications, and publications from 
NeoProbe. 

15 A process for preparing conjugation derivatives is also 

contemplated. Tumor cells, for example, exhibit epitopes not found on 
their normal counterparts. Such epitopes include, for example, different 
post-translational modifications resulting from their rapid proliferation. 
Thus, one aspect of this invention is a process comprising: 
2 0 a) selecting at least one randomized peptide that specifically 

binds to a target epitope; and 
b) preparing a pharmacologic agent comprising (i) at least one 
vehicle (Fc domain preferred), (ii) at least one amino acid 
sequence of the selected peptide or peptides, and (iii) an 
25 effector molecule. 

The target epitope is preferably a tumor-specific epitope or an epitope 
specific to a pathogenic organism. The effector molecule may be any of the 
above-noted conjugation partners and is preferably a radioisotope. 
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Methods of Making 

The compounds of this invention largely may be made in 
transformed host cells using recombinant DNA techniques. To do so, a 
recombinant DNA molecule coding for the peptide is prepared. Methods 
5 of preparing such DNA molecules are well known in the art. For instance, 
sequences coding for the peptides could be excised from DNA using 
suitable restriction enzymes. Alternatively, the DNA molecule could be 
synthesized using chemical synthesis techniques, such as the 
phosphoramidate method. Also, a combination of these techniques could 
10 be used. 

The invention also includes a vector capable of expressing the 
peptides in an appropriate host. The vector comprises the DNA molecule 
that codes for the peptides operatively linked to appropriate expression 
control sequences. Methods of effecting this operative linking, either 

15 before or after the DNA molecule is inserted into the vector, are well 
known. Expression control sequences include promoters, activators, 
enhancers, operators, ribosomal binding sites, start signals, stop signals, 
cap signals, polyadenylation signals, and other signals involved with the 
control of transcription or translation. 

2 0 The resulting vector having the DNA molecule thereon is used to 

transform an appropriate host. This transformation may be performed 
using methods well known in the art. 

Any of a large number of available and well-known host cells may 
be used in the practice of this invention. The selection of a particular host 

25 is dependent upon a number of factors recognized by the art. These 
include, for example, compatibility with the chosen expression vector, 
toxicity of the peptides encoded by the DNA molecule, rate of 
transformation, ease of recovery of the peptides, expression characteristics, 
bio-safety and costs. A balance of these factors must be struck with the 
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understanding that not all hosts may be equally effective for the 
expression of a particular DNA sequence. Within these general guidelines, 
useful microbial hosts include bacteria (such as E. coli sp.)/ yeast (such as 
Saccharomyces sp.) and other fungi, insects, plants, mammalian (including 
5 human) cells in culture, or other hosts known in the art. 

Next, the transformed host is cultured and purified. Host cells may 
be cultured under conventional fermentation conditions so that the 
desired compounds are expressed. Such fermentation conditions are well 
known in the art. Finally, the peptides are purified from culture by 

1 0 methods well known in the art. 

The compounds may also be made by synthetic methods. For 
example, solid phase synthesis techniques may be used. Suitable 
techniques are well known in the art, and include those described in 
Merrifield (1973), Chem. Polypeptides, pp. 335-61 (Katsoyannis and 

15 Panayotis eds.); Merrifield (1963), T. Am. Chem. Soc . 85: 2149; Davis etal. 
(1985), Biochem. Intl . 10: 394-414; Stewart and Young (1969), Solid Phase 
Peptide Synthesis; U.S. Pat. No. 3,941,763; Finn etal. (1976), The Proteins 
(3rd ed.) 2: 105-253; and Erickson etal. (1976), The Proteins (3rd ed.) 2: 
257-527. Solid phase synthesis is the preferred technique of making 

2 0 individual peptides since it is the most cost-effective method of making 
small peptides. 

Compounds that contain derivatized peptides or which contain 
non-peptide groups may be synthesized by well-known organic chemistry 
techniques. 
2 5 Uses of the Compounds 

In general . The compounds of this invention have pharmacologic 
activity resulting from their ability to bind to proteins of interest as 
agonists, mimetics or antagonists of the native ligands of such proteins of 
interest. The utility of specific compounds is shown in Table 2. The activity 
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of these compounds can be measured by assays known in the art. For the 
TPO-mimetic and EPO-mimetic compounds, in vivo assays are further 
described in the Examples section herein. 

In addition to therapeutic uses, the compounds of the present 
5 invention are useful in diagnosing diseases characterized by dysfunction 
of their associated protein of interest. In one embodiment, a method of 
detecting in a biological sample a protein of interest (e.g., a receptor) that 
is capable of being activated comprising the steps of: (a) contacting the 
sample with a compound of this invention; and (b) detecting activation of 

1 0 the protein of interest by the compound. The biological samples include 
tissue specimens, intact cells, or extracts thereof. The compounds of this 
invention may be used as part of a diagnostic kit to detect the presence of 
their associated proteins of interest in a biological sample. Such kits 
employ the compounds of the invention having an attached label to allow 

15 for detection. The compounds are useful for identifying normal or 
abnormal proteins of interest. For the EPO-mimetic compounds, for 
example, presence of abnormal protein of interest in a biological sample 
may be indicative of such disorders as Diamond Blackf an anemia, where it 
is believed that the EPO receptor is dysfunctional. 

2 0 Therapeutic uses of EPO-mimetic compounds . The EPO-mimetic 

compounds of the invention are useful for treating disorders characterized 
by low red blood cell levels. Included in the invention are methods of 
modulating the endogenous activity of an EPO receptor in a mammal, 
preferably methods of increasing the activity of an EPO receptor. In 

2 5 general, any condition treatable by erythropoietin, such as anemia, may 
also be treated by the EPO-mimetic compounds of the invention. These 
compounds are administered by an amount and route of delivery that is 
appropriate for the nature and severity of the condition being treated and 
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may be ascertained by one skilled in the art. Preferably, administration is 
by injection, either subcutaneous, intramuscular, or intravenous. 

Therapeutic uses of TPO-mimetic compounds . For the TPO- 
mimetic compounds, one can utilize such standard assays as those 
5 described in W095/26746 entitled "Compositions and Methods for 

Stimulating Megakaryocyte Growth and Differentiation". In vivo assays 
also appear in the Examples hereinafter. 

The conditions to be treated are generally those that involve an 
existing megakaryocyte/platelet deficiency or an expected 

1 0 megakaryocyte/platelet deficiency (e.g., because of planned surgery or 
platelet donation). Such conditions will usually be the result of a 
deficiency (temporary or permanent) of active Mpl ligand in vivo . The 
generic term for platelet deficiency is thrombocytopenia, and hence the 
methods and compositions of the present invention are generally available 

15 for treating thrombocytopenia in patients in need thereof. 

Thrombocytopenia (platelet deficiencies) may be present for 
various reasons, including chemotherapy and other therapy with a variety 
of drugs, radiation therapy, surgery, accidental blood loss, and other 
specific disease conditions. Exemplary specific disease conditions that 

2 0 involve thrombocytopenia and may be treated in accordance with this 
invention are: aplastic anemia, idiopathic thrombocytopenia, metastatic 
tumors which result in thrombocytopenia, systemic lupus erythematosus, 
splenomegaly, Fanconi's syndrome, vitamin B12 deficiency, folic acid 
deficiency, May-Hegglin anomaly, Wiskott-Aldrich syndrome, and 

2 5 paroxysmal nocturnal hemoglobinuria. Also, certain treatments for AIDS 
result in thrombocytopenia (e.g., AZT). Certain wound healing disorders 
might also benefit from an increase in platelet numbers. 

With regard to anticipated platelet deficiencies, e.g., due to future 
surgery, a compound of the present invention could be administered 
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several days to several hours prior to the need for platelets. With regard 
to acute situations, e.g., accidental and massive blood loss, a compound of 
this invention could be administered along with blood or purified 
platelets. 

5 The TPO-mimetic compounds of this invention may also be useful in 

stimulating certain cell types other than megakaryocytes if such cells are found 
to express Mpl receptor. Conditions associated with such cells that express the 
Mpl receptor, which are responsive to stimulation by the Mpl ligand, are also 
within the scope of this invention. 

10 The TPO-mimetic compounds of this invention may be used in any 

situation in which production of platelets or platelet precursor cells is desired, 
or in which stimulation of the c-Mpl receptor is desired. Thus, for example, the 
compounds of this invention may be used to treat any condition in a mammal 
wherein there is a need of platelets, megakaryocytes, and the like. Such 

15 conditions are described in detail in the following exemplary sources: 

W095/26746; W095/21919; W095/18858; WO95/21920 and are incorporated 
herein. 

The TPO-mimetic compounds of this invention may also be useful in 
maintaining the viability or storage life of platelets and/ or megakaryocytes and 

2 0 related cells. Accordingly, it could be useful to include an effective amount of 
one or more such compounds in a composition containing such cells. 

The therapeutic methods, compositions and compounds of the 
present invention may also be employed, alone or in combination with 
other cytokines, soluble Mpl receptor, hematopoietic factors, interleukins, 

2 5 growth factors or antibodies in the treatment of disease states 

characterized by other symptoms as well as platelet deficiencies. It is 
anticipated that the inventive compound will prove useful in treating 
some forms of thrombocytopenia in combination with general stimulators 
of hematopoiesis, such as IL-3 or GM-CSR Other megakaryocyte 
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stimulatory factors, i.e., meg-CSF, stem cell factor (SCF), leukemia 
inhibitory factor (LIF), oncostatin M (OSM), or other molecules with 
megakaryocyte stimulating activity may also be employed with Mpl 
ligand. Additional exemplary cytokines or hematopoietic factors for such 
5 co-administration include IL-1 alpha, IL-1 beta, IL-2, IL-3, IL-4, IL-5, IL-6, 
IL-11, colony stimulating factor- 1 (CSF-1), SCF, GM-CSF, granulocyte 
colony stimulating factor (G-CSF), EPO, interferon-alpha (IFN-alpha), 
consensus interferon, IFN-beta, or IFN-gamma. It may further be useful to 
administer, either simultaneously or sequentially, an effective amount of a 

10 soluble mammalian Mpl receptor, which appears to have an effect of 
causing megakaryocytes to fragment into platelets once the 
megakaryocytes have reached mature form. Thus, administration of an 
inventive compound (to enhance the number of mature megakaryocytes) 
followed by administration of the soluble Mpl receptor (to inactivate the 

15 ligand and allow the mature megakaryocytes to produce platelets) is 
expected to be a particularly effective means of stimulating platelet 
production. The dosage recited above would be adjusted to compensate 
for such additional components in the therapeutic composition. Progress 
of the treated patient can be monitored by conventional methods. 

2 0 In cases where the inventive compounds are added to compositions 

of platelets and/ or megakaryocytes and related cells, the amount to be 
included will generally be ascertained experimentally by techniques and 
assays known in the art. An exemplary range of amounts is 0.1 jxg — 1 mg 
inventive compound per 10 6 cells. 

2 5 Pharmaceutical Compositions 

In General . The present invention also provides methods of using 
pharmaceutical compositions of the inventive compounds. Such 
pharmaceutical compositions may be for administration for injection, or for 
oral, pulmonary, nasal, transdermal or other forms of administration. In 
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general, the invention encompasses pharmaceutical compositions comprising 
effective amounts of a compound of the invention together with 
pharmaceutically acceptable diluents, preservatives, solubilizers, emulsifiers, 
adjuvants and /or carriers. Such compositions include diluents of various 
5 buffer content {e.g., Tris-HCl, acetate, phosphate), pH and ionic strength; 
additives such as detergents and solubilizing agents {e.g., Tween 80, 
Polysorbate 80), anti-oxidants {e.g., ascorbic acid, sodium metabisulfite), 
preservatives {e.g., Thimersol, benzyl alcohol) and bulking substances {e.g., 
lactose, mannitol); incorporation of the material into particulate preparations of 

1 0 polymeric compounds such as polylactic acid, polyglycolic acid, etc. or into 
liposomes. Hyaluronic acid may also be used, and this may have the effect of 
promoting sustained duration in the circulation. Such compositions may 
influence the physical state, stability, rate of in vivo release, and rate of in vivo 
clearance of the present proteins and derivatives. See, e.g., Remington's 

15 Pharmaceutical Sciences, 18th Ed. (1990, Mack Publishing Co., Easton, PA 
18042) pages 1435-1712 which are herein incorporated by reference. The 
compositions may be prepared in liquid form, or may be in dried powder, such 
as lyophilized form. Implantable sustained release formulations are also 
contemplated, as are transdermal formulations. 

2 0 Oral dosage forms . Contemplated for use herein are oral solid 

dosage forms, which are described generally in Chapter 89 of Remington's 
Pharmaceutical Sciences (1990), 18th Ed., Mack Publishing Co. Easton PA 
18042, which is herein incorporated by reference. Solid dosage forms 
include tablets, capsules, pills, troches or lozenges, cachets or pellets. Also, 

2 5 liposomal or proteinoid encapsulation may be used to formulate the 

present compositions (as, for example, proteinoid microspheres reported 
in U.S. Patent No. 4,925,673). Liposomal encapsulation may be used and 
the liposomes may be derivatized with various polymers (e.g., U.S. Patent 
No. 5,013,556). A description of possible solid dosage forms for the 
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therapeutic is given in Chapter 10 of Marshall, K v Modern Pharmaceutics 
(1979), edited by G. S. Banker and C. T. Rhodes, herein incorporated by 
reference. In general, the formulation will include the inventive 
compound, and inert ingredients which allow for protection against the 
5 stomach environment, and release of the biologically active material in the 
intestine. 

Also specifically contemplated are oral dosage forms of the above 
inventive compounds. If necessary, the compounds may be chemically 
modified so that oral delivery is efficacious. Generally, the chemical 

1 0 modification contemplated is the attachment of at least one moiety to the 
compound molecule itself, where said moiety permits (a) inhibition of 
proteolysis; and (b) uptake into the blood stream from the stomach or 
intestine. Also desired is the increase in overall stability of the compound 
and increase in circulation time in the body. Moieties useful as covalently 

15 attached vehicles in this invention may also be used for this purpose. 
Examples of such moieties include: PEG, copolymers of ethylene glycol 
and propylene glycol, carboxymethyl cellulose, dextran, polyvinyl alcohol, 
polyvinyl pyrrolidone and polyproline. See, for example, Abuchowski and 
Davis, Soluble Polymer-Enzyme Adducts, Enzymes as Drugs (1981), 

2 0 Hocenberg and Roberts, eds., Wiley-Interscience, New York, NY, , pp 367- 
83; Newmark, etal. (1982), T. Appl. Biochem . 4:185-9. Other polymers that 
could be used are poly-l,3-dioxolane and poly-l,3,6-tioxocane. Preferred 
for pharmaceutical usage, as indicated above, are PEG moieties. 

For oral delivery dosage forms, it is also possible to use a salt of a 

2 5 modified aliphatic amino acid, such as sodium N-(8-[2-hydroxybenzoyl] 
amino) caprylate (SNAC), as a carrier to enhance absorption of the 
therapeutic compounds of this invention. The clinical efficacy of a heparin 
formulation using SNAC has been demonstrated in a Phase II trial 
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conducted by Emisphere Technologies. See US Patent No. 5,792,451, "Oral 
drug delivery composition and methods". 

The compounds of this invention can be included in the 
formulation as fine multiparticulates in the form of granules or pellets of 
5 particle size about 1 mm. The formulation of the material for capsule 
administration could also be as a powder, lightly compressed plugs or 
even as tablets. The therapeutic could be prepared by compression. 

Colorants and flavoring agents may all be included. For example, 
the protein (or derivative) may be formulated (such as by liposome or 

1 0 microsphere encapsulation) and then further contained within an edible 
product, such as a refrigerated beverage containing colorants and 
flavoring agents. 

One may dilute or increase the volume of the compound of the 
invention with an inert material. These diluents could include 

15 carbohydrates, especially mannitol, a-lactose, anhydrous lactose, cellulose, 
sucrose, modified dextrans and starch. Certain inorganic salts may also be 
used as fillers including calcium triphosphate, magnesium carbonate and 
sodium chloride. Some commercially available diluents are Fast-Flo, 
Emdex, STA-Rx 1500, Emcompress and Avicell. 

2 0 Disintegrants may be included in the formulation of the therapeutic 

into a solid dosage form. Materials used as disintegrants include but are 
not limited to starch including the commercial disintegrant based on 
starch, Explotab. Sodium starch glycolate, Amber lite, sodium 
carboxymethylcellulose, ultramylopectin, sodium alginate, gelatin, orange 

2 5 peel, acid carboxymethyl cellulose, natural sponge and bentonite may all 
be used. Another form of the disintegrants are the insoluble cationic 
exchange resins. Powdered gums may be used as disintegrants and as 
binders and these can include powdered gums such as agar, Karaya or 
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tragacanth. Alginic acid and its sodium salt are also useful as 
disintegrants. 

Binders may be used to hold the therapeutic agent together to form 
a hard tablet and include materials from natural products such as acacia, 
5 tragacanth, starch and gelatin. Others include methyl cellulose (MC), ethyl 
cellulose (EC) and carboxymethyl cellulose (CMC). Polyvinyl pyrrolidone 
(PVP) and hydroxypropylmethyl cellulose (HPMC) could both be used in 
alcoholic solutions to granulate the therapeutic. 

An antifrictional agent may be included in the formulation of the 

1 0 therapeutic to prevent sticking during the formulation process. Lubricants 
may be used as a layer between the therapeutic and the die wall, and these 
can include but are not limited to; stearic acid including its magnesium 
and calcium salts, polytetrafluoroethylene (PTFE), liquid paraffin, 
vegetable oils and waxes. Soluble lubricants may also be used such as 

15 sodium lauryl sulfate, magnesium lauryl sulfate, polyethylene glycol of 
various molecular weights, Carbowax 4000 and 6000. 

Glidants that might improve the flow properties of the drug during 
formulation and to aid rearrangement during compression might be 
added. The glidants may include starch, talc, pyro genie silica and 

2 0 hydrated silicoaluminate. 

To aid dissolution of the compound of this invention into the 
aqueous environment a surfactant might be added as a wetting agent. 
Surfactants may include anionic detergents such as sodium lauryl sulfate, 
dioctyl sodium sulfosuccinate and dioctyl sodium sulfonate. Cationic 

2 5 detergents might be used and could include benzalkonium chloride or 
benzethonium chloride. The list of potential nonionic detergents that 
could be included in the formulation as surfactants are lauromacrogol 400, 
polyoxyl 40 stearate, polyoxyethylene hydrogenated castor oil 10, 50 and 
60, glycerol monostearate, polysorbate 40, 60, 65 and 80, sucrose fatty acid 
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ester, methyl cellulose and carboxymethyl cellulose. These surfactants 
could be present in the formulation of the protein or derivative either 
alone or as a mixture in different ratios. 

Additives may also be included in the formulation to enhance 
5 uptake of the compound. Additives potentially having this property are 
for instance the fatty acids oleic acid, linoleic acid and linolenic acid. 

Controlled release formulation may be desirable. The compound of 
this invention could be incorporated into an inert matrix which permits 
release by either diffusion or leaching mechanisms e.g., gums. Slowly 

1 0 degenerating matrices may also be incorporated into the formulation, e.g., 
alginates, polysaccharides. Another form of a controlled release of the 
compounds of this invention is by a method based on the Oros therapeutic 
system (Alza Corp.), i.e., the drug is enclosed in a semipermeable 
membrane which allows water to enter and push drug out through a 

15 single small opening due to osmotic effects. Some enteric coatings also 
have a delayed release effect. 

Other coatings may be used for the formulation. These include a 
variety of sugars which could be applied in a coating pan. The therapeutic 
agent could also be given in a film coated tablet and the materials used in 

2 0 this instance are divided into 2 groups. The first are the nonenteric 
materials and include methyl cellulose, ethyl cellulose, hydroxyethyl 
cellulose, methylhydroxy-ethyl cellulose, hydroxypropyl cellulose, 
hydroxypropyl-methyl cellulose, sodium carboxy-methyl cellulose, 
providone and the polyethylene glycols. The second group consists of the 

2 5 enteric materials that are commonly esters of phthalic acid. 

A mix of materials might be used to provide the optimum film 
coating. Film coating may be carried out in a pan coater or in a fluidized 
bed or by compression coating. 
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Pulmonary delivery forms . Also contemplated herein is pulmonary 
delivery of the present protein (or derivatives thereof). The protein (or 
derivative) is delivered to the lungs of a mammal while inhaling and 
traverses across the lung epithelial lining to the blood stream. (Other 
5 reports of this include Adjei et al ., Pharma. Res . (1990) 7: 565-9; Adjei et al . 
(1990), Internatl. T> Pharmaceutics 63: 135-44 (leuprolide acetate); Braquet 
et al . (1989), T. Cardiovasc. Pharmacol . 13 (suppl.5): s.143-146 (endothelin- 
1); Hubbard et al . (1989), Annals Int. Med . 3: 206-12 (al-antitrypsin); Smith 
et al . (1989), T. Clin. Invest . 84: 1145-6 (al-proteinase); Oswein etal. (March 
1 0 1990), "Aerosolization of Proteins", Proc. Symp. Resp. Drug Delivery II, 
Keystone, Colorado (recombinant human growth hormone); Debs et al . 
(1988), T. Immunol. 140: 3482-8 (interferon-y and tumor necrosis factor a) 
and Platz et al ., U.S. Patent No. 5,284,656 (granulocyte colony stimulating 
factor). 

15 Contemplated for use in the practice of this invention are a wide 

range of mechanical devices designed for pulmonary delivery of 
therapeutic products, including but not limited to nebulizers, metered 
dose inhalers, and powder inhalers, all of which are familiar to those 
skilled in the art. Some specific examples of commercially available 

2 0 devices suitable for the practice of this invention are the Ultravent 

nebulizer, manufactured by Mallinckrodt, Inc., St. Louis, Missouri; the 
Acorn II nebulizer, manufactured by Marquest Medical Products, 
Englewood, Colorado; the Ventolin metered dose inhaler, manufactured 
by Glaxo Inc., Research Triangle Park, North Carolina; and the Spinhaler 

2 5 powder inhaler, manufactured by Fisons Corp., Bedford, Massachusetts. 

All such devices require the use of formulations suitable for the 
dispensing of the inventive compound. Typically, each formulation is 
specific to the type of device employed and may involve the use of an 
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appropriate propellant material, in addition to diluents, adjuvants 
and /or carriers useful in therapy. 

The inventive compound should most advantageously be 
prepared in particulate form with an average particle size of less than 10 
5 jxm (or microns), most preferably 0.5 to 5 jLim, for most effective delivery 
to the distal lung. 

Pharmaceutically acceptable carriers include carbohydrates such 
as trehalose, mannitol, xylitol, sucrose, lactose, and sorbitol. Other 
ingredients for use in formulations may include DPPC, DOPE, DSPC and 

1 0 DOPC. Natural or synthetic surfactants may be used. PEG may be used 
(even apart from its use in derivatizing the protein or analog). Dextrans, 
such as cyclodextran, may be used. Bile salts and other related enhancers 
may be used. Cellulose and cellulose derivatives may be used. Amino 
acids may be used, such as use in a buffer formulation. 

15 Also, the use of liposomes, microcapsules or microspheres, 

inclusion complexes, or other types of carriers is contemplated. 

Formulations suitable for use with a nebulizer, either jet or 
ultrasonic, will typically comprise the inventive compound dissolved in 
water at a concentration of about 0.1 to 25 mg of biologically active protein 

2 0 per mL of solution. The formulation may also include a buffer and a 
simple sugar (e.g., for protein stabilization and regulation of osmotic 
pressure). The nebulizer formulation may also contain a surfactant, to 
reduce or prevent surface induced aggregation of the protein caused by 
atomization of the solution in forming the aerosol. 

2 5 Formulations for use with a metered-dose inhaler device will 

generally comprise a finely divided powder containing the inventive 
compound suspended in a propellant with the aid of a surfactant. The 
propellant may be any conventional material employed for this purpose, 
such as a chlorofluorocarbon, a hydrochlorofluorocarbon, a 
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hydrofluorocarbon, or a hydrocarbon, including trichlorofluoromethane, 
dichlorodifluoromethane, dichlorotetrafluoroethanol, and 1,1,1,2- 
tetrafluoroethane, or combinations thereof. Suitable surfactants include 
sorbitan trioleate and soya lecithin. Oleic acid may also be useful as a 
5 surfactant. 

Formulations for dispensing from a powder inhaler device will 
comprise a finely divided dry powder containing the inventive compound 
and may also include a bulking agent, such as lactose, sorbitol, sucrose, 
mannitol, trehalose, or xylitol in amounts which facilitate dispersal of the 
1 0 powder from the device, e.g., 50 to 90% by weight of the formulation. 

Nasal delivery forms . Nasal delivery of the inventive compound is 
also contemplated. Nasal delivery allows the passage of the protein to the 
blood stream directly after administering the therapeutic product to the 
nose, without the necessity for deposition of the product in the lung. 
15 Formulations for nasal delivery include those with dextran or 

cyclodextran. Delivery via transport across other mucous membranes is 
also contemplated. 

Buccal delivery forms. Buccal delivery of the inventive compound 
is also contemplated. Buccal delivery formulations are known in the art for 
2 0 use with peptides. 

Dosages . The dosage regimen involved in a method for treating the 
above-described conditions will be determined by the attending physician, 
considering various factors which modify the action of drugs, e.g. the age, 
condition, body weight, sex and diet of the patient, the severity of any infection, 
2 5 time of administration and other clinical factors. Generally, the daily regimen 
should be in the range of 0.1-1000 micrograms of the inventive compound per 
kilogram of body weight, preferably 0.1-150 micrograms per kilogram. 
Specific preferred embodiments 
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The inventors have determined preferred peptide sequences for 
molecules having many different kinds of activity. The inventors have 
further determined preferred structures of these preferred peptides 
combined with preferred linkers and vehicles. Preferred structures for 
5 these preferred peptides listed in Table 21 below. 

Table 21 — Preferred embodiments 



Sequence/structure 


SEQ 
ID 
NO: 


Activity 


F -(G) 5 -IEGPTLRQWLAARA-(G) 8 -IEGPTLRQWLAARA 


337 


TPO-mimetic 


IEGPTLRQWLAARA-(G) R -1EGPTLRQWLAARA-(G) B - F 1 


338 


TPO-mimetic 


F 1 -(G) 5 - 1 EG PTLRQ WLAAR A 


1032 


TPO-mimetic 


I EG PTLRQ WLAAR A -(G) 5 - F 


1033 


TPO-mimetic 


F -(G) 5 -GGTYSCHFGPLTWVCKPQGG-(G) 4 - 
GGTYSCHFGPLTWVCKPQGG 


339 


EPO-mimetic 


1 i uUnrurL 1 VV VUi\rwuu-^uj 4 - 

GGTYSCHFGPLTWVCKPQGG-(G) 5 -F 1 


340 


P PH - m i yt\ of if* 


GGTYSCHFGPLTWVCKPQGG-(G) 5 -F 1 


1034 


EPO-mimetic 


F 1 -(G) S -DFLPHYKNTSLGHRP 


1045 


TNF-a inhibitor 


DFLPH YKNTSLG H RP-(G) 5 -F 1 


1046 


TNF-a inhibitor 


F 1 -(G) 5 " FEWTPGYWQPYALPL 


1047 


IL-1 R antagonist 


FEWTPGYWQPYALPL-(G) S -F 1 


1048 


IL-1 R antagonist 


F'-(G) 5 -VEPNCDIHVMWEWECFERL 


1049 


VEGF-antagonist 


VEPNCDIHVMWEWECFERL-(G) S -F 1 


1050 


VEGF-antagonist 


F 1 -(G) 5 -CTTHWGFTLC 


1051 


MMP inhibitor 


CTTHWGFTLC-(G) 5 -F 1 


1052 


MMP inhibitor 



"F 1 " is an Fc domain as defined previously herein. 



Working examples 

The compounds described above may be prepared as described 
10 below. These examples comprise preferred embodiments of the invention 
and are illustrative rather than limiting. 
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Example 1 
TPO-Mimetics 

The following example uses peptides identified by the numbers 
appearing in Table A hereinafter. 
5 Preparation of peptide 19 . Peptide 17b (12 mg) and MeO-PEG-SH 

5000 (30 mg, 2 equiv.) were dissolved in 1 ml aqueous buffer (pH 8). The 
mixture was incubated at RT for about 30 minutes and the reaction was 
checked by analytical HPLC, which showed a > 80% completion of the 
reaction. The pegylated material was isolated by preparative HPLC. 

10 Preparation of peptide 20 . Peptide 18 (14 mg) and MeO-PEG- 

maleimide (25 mg) were dissolved in about 1.5 ml aqueous buffer (pH 8). 
The mixture was incubated at RT for about 30 minutes, at which time 
about 70% transformation was complete as monitored with analytical 
HPLC by applying an aliquot of sample to the HPLC column. The 

15 pegylated material was purified by preparative HPLC. 

Bioactivity assay . The TPO in vitro bioassay is a mi to genie assay 
utilizing an IL-3 dependent clone of murine 32D cells that have been 
transfected with human mpl receptor. This assay is described in greater 
detail in WO 95/26746. Cells are maintained in MEM medium containing 

2 0 10% Fetal Clone II and 1 ng/ ml mIL-3. Prior to sample addition, cells are 
prepared by rinsing twice with growth medium lacking mIL-3. An 
extended twelve point TPO standard curve is prepared, ranging from 33 
to 39 pg/ml. Four dilutions, estimated to fall within the linear portion of 
the standard curve, (100 to 125 pg/ ml), are prepared for each sample and 

2 5 run in triplicate. A volume of 100 fil of each dilution of sample or 
standard is added to appropriate wells of a 96 well microtiter plate 
containing 10,000 cells/well. After forty-four hours at 37 °C and 10% C0 2 , 
MTS (a tetrazolium compound which is bioreduced by cells to a formazan) 
is added to each well. Approximately six hours later, the optical density is 
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read on a plate reader at 490 nm. A dose response curve (log TPO 
concentration vs. O.D.- Background) is generated and linear regression 
analysis of points which fall in the linear portion of the standard curve is 
performed. Concentrations of unknown test samples are determined 
5 using the resulting linear equation and a correction for the dilution factor. 
TMP tandem repeats with polyglycine linkers . Our design of 
sequentially linked TMP repeats was based on the assumption that a 
dimeric form of TMP was required for its effective interaction with c-Mpl 
(the TPO receptor) and that depending on how they were wound up 

1 0 against each other in the receptor context, the two TMP molecules could 
be tethered together in the C- to N-terminus configuration in a way that 
would not perturb the global dimeric conformation. Clearly, the success 
of the design of tandem linked repeats depends on proper selection of the 
length and composition of the linker that joins the C- and N- termini of the 

15 two sequentially aligned TMP monomers. Since no structural information 
of the TMP bound to c-Mpl was available, a series of repeated peptides 
with linkers composed of 0 to 10 and 14 glycine residues (Table A) were 
synthesized. Glycine was chosen because of its simplicity and flexibility, 
based on the rationale that a flexible polyglycine peptide chain might 

2 0 allow for the free folding of the two tethered TMP repeats into the 

required conformation, while other amino acid sequences may adopt 
undesired secondary structures whose rigidity might disrupt the correct 
packing of the repeated peptide in the receptor context. 

The resulting peptides are readily accessible by conventional solid 

2 5 phase peptide synthesis methods (Merrifield (1963), T. Amer. Chem. Soc . 
85: 2149) with either Fmoc or t~Boc chemistry. Unlike the synthesis of the 
C-terminally linked parallel dimer which required the use of an 
orthogonally protected lysine residue as the initial branch point to build 
the two peptide chains in a pseudosymmetrical way (Cwirla et al . (1997), 
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Science 276: 1696-9), the synthesis of these tandem repeats was a 
straightforward, stepwise assembly of the continuous peptide chains from 
the C- to N-terminus- Since dimerization of TMP had a more dramatic 
effect on the proliferative activity than binding affinity as shown for the C- 
5 terminal dimer (Cwirla etal. (1997)), the synthetic peptides were tested 
directly for biological activity in a TPO-dependent cell-proliferation assay 
using an IL-3 dependent clone of murine 32D cells transfected with the 
full-length c-Mpl (Palacios etal.,. Cell 41:727 (1985)). As the test results 
showed, all the polyglycine linked tandem repeats demonstrated >1000 

1 0 fold increases in potency as compared to the monomer, and were even 

more potent than the C-terminal dimer in this cell proliferation assay. The 
absolute activity of the C-terminal dimer in our assay was lower than that 
of the native TPO protein, which is different from the previously reported 
findings in which the C-terminal dimer was found to be as active as the 

1 5 natural ligand (Cwirla etal. (1997)). This might be due to differences in 
the conditions used in the two assays. Nevertheless, the difference in 
activity between tandem (C terminal of first monomer linked to N 
terminal of second monomer) and C-terminal (C terminal of first monomer 
linked to C terminal of second monomer; also referred to as parallel) 

2 0 dimers in the same assay clearly demonstrated the superiority of tandem 
repeat strategy over parallel peptide dimerization. It is interesting to note 
that a wide range of length is tolerated by the linker. The optimal linker 
between tandem peptides with the selected TMP monomers apparently is 
composed of 8 glycines. 

2 5 Other tandem repeats . Subsequent to this first series of TMP 

tandem repeats, several other molecules were designed either with 
different linkers or containing modifications within the monomer itself. 
The first of these molecules, peptide 13, has a linker composed of GPNG, a 
sequence known to have a high propensity to form a p -turn- type 



-93- 



WO 01/83525 



PCT/US01/14310 



secondary structure. Although still about 100-fold more potent than the 
monomer, this peptide was found to be >10-fold less active than the 
equivalent GGGG-linked analog. Thus, introduction of a relatively rigid 
p-turn at the linker region seemed to have caused a slight distortion of the 
5 optimal agonist conformation in this short linker form. 

The Trp9 in the TMP sequence is a highly conserved residue among 
the active peptides isolated from random peptide libraries. There is also a 
highly conserved Trp in the consensus sequences of EPO mimetic peptides 
and this Trp residue was found to be involved in the formation of a 

1 0 hydrophobic core between the two EMPs and contributed to hydrophobic 
interactions with the EPO receptor. Livnah etal. (1996), Science 273: 464- 
71). By analogy, the Trp9 residue in TMP might have a similar function in 
dimerization of the peptide ligand, and as an attempt to modulate and 
estimate the effects of noncovalent hydrophobic forces exerted by the two 

1 5 indole rings, several analogs were made resulting from mutations at the 
Trp. So in peptide 14, the Trp residue was replaced in each of the two 
TMP monomers with a Cys, and an intramolecular disulfide bond was 
formed between the two cysteines by oxidation which was envisioned to 
mimic the hydrophobic interactions between the two Trp residues in 

2 0 peptide dimerization. Peptide 15 is the reduced form of peptide 14. In 
peptide 16, the two Trp residues were replaced by Ala. As the assay data 
show, all three analogs were inactive. These data further demonstrated 
that Trp is critical for the activity of the TPO mimetic peptide, not just for 
dimer formation. 

2 5 The next two peptides (peptide 17a, and 18) each contain in their 8- 

amino acid linker a Lys or Cys residue. These two compounds are 
precursors to the two PEGylated peptides (peptide 19 and 20) in which the 
side chain of the Lys or Cys is modified by a PEG moiety. A PEG moiety 
was introduced at the middle of a relatively long linker, so that the large 
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PEG component (5 kDa) is far enough away from the critical binding sites 
in the peptide molecule. PEG is a known biocompatible polymer which is 
increasingly used as a covalent modifier to improve the pharmacokinetic 
profiles of peptide- and protein-based therapeutics. 
5 A modular, solution-based method was devised for convenient 

PEGylation of synthetic or recombinant peptides. The method is based on 
the now well established chemoselective ligation strategy which utilizes 
the specific reaction between a pair of mutually reactive functionalities. 
So, for pegylated peptide 19, the lysine side chain was preactivated with a 

10 bromoacetyl group to give peptide 17b to accommodate reaction with a 
thiol-derivatized PEG. To do that, an orthogonal protecting group, Dde, 
was employed for the protection of the lysine £- amine. Once the whole 
peptide chain was assembled, the NT-terminal amine was reprotected with 
t-Boc. Dde was then removed to allow for the bromoacetylation. This 

15 strategy gave a high quality crude peptide which was easily purified using 
conventional reverse phase HPLC. Ligation of the peptide with the thiol- 
modified PEG took place in aqueous buffer at pH 8 and the reaction 
completed within 30 minutes. MALDI-MS analysis of the purified, 
pegylated material revealed a characteristic, bell-shaped spectrum with an 

2 0 increment of 44 Da between the adjacent peaks. For PEG-peptide 20, a 
cysteine residue was placed in the linker region and its side chain thiol 
group would serve as an attachment site for a maleimide-containing PEG. 
Similar conditions were used for the pegylation of this peptide. As the 
assay data revealed, these two pegylated peptides had even higher in vitro 

2 5 bioactivity as compared to their unpegylated counterparts. 

Peptide 21 has in its 8-amino acid linker a potential glycosylation 
motif, NGS. Since our exemplary tandem repeats are made up of natural 
amino acids linked by peptide bonds, expression of such a molecule in an 
appropriate eukaryotic cell system should produce a glycopeptide with 
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the carbohydrate moiety added on the side chain carboxyamide of Asn. 
Glycosylation is a common post-translational modification process which 
can have many positive impacts on the biological activity of a given 
protein by increasing its aqueous solubility and in vivo stability. As the 
5 assay data show, incorporation of this glycosylation motif into the linker 
maintained high bioactivity. The synthetic precursor of the potential 
glycopeptide had in effect an activity comparable to that of the -(G) 8 - 
linked analog. Once glycosylated, this peptide is expected to have the 
same order of activity as the pegylated peptides, because of the similar 

1 0 chemophysical properties exhibited by a PEG and a carbohydrate moiety. 

The last peptide is a dimer of a tandem repeat. It was prepared by 
oxidizing peptide 18, which formed an intermolecular disulfide bond 
between the two cysteine residues located at the linker. This peptide was 
designed to address the possibility that TMP was active as a tetramer. The 

15 assay data showed that this peptide was not more active than an average 
tandem repeat on an adjusted molar basis, which indirectly supports the 
idea that the active form of TMP is indeed a dimer, otherwise dimerization 
of a tandem repeat would have a further impact on the bioactivity. 

In order to confirm the in vitro data in animals, one pegylated TMP 

2 0 tandem repeat (compound 20 in Table A) was delivered subcutaneously to 
normal mice via osmotic pumps. Time and dose-dependent increases 
were seen in platelet numbers for the duration of treatment. Peak platelet 
levels over 4-fold baseline were seen on day 8. A dose of 10 jig/kg/day of 
the pegylated TMP repeat produced a similar response to rHuMGDF 

2 5 (non-pegylated) at 100 jxg/kg/ day delivered by the same route. 
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Table A — TPO-mimetic Peptides 



Peptide 


Compound 


SEQID 


Relative 


No. 




NO: 


Potency 




TPO 




++++ 




TMP monomer 


13 


+ 




TMP C-C dimer 




+++- 


TMP-(G) n -TMP: 






1 


n = 0 


341 


++++- 


2 


n = 1 


342 


++++ 


3 


n = 2 


343 


++++ 


4 


n = 3 


344 


++++ 


5 


n = 4 


345 


++++ 


6 


n = 5 


346 


++++ 


7 


n = 6 


347 


++++ 


8 


n = 7 


348 


++++ 


9 


n = 8 


349 


++++- 


10 


n = 9 


350 


++++ 


11 


n = 10 


351 


++++ 


12 


n = 14 


352 


++++ 


13 


TMP-GPNG-TMP 


353 


+++ 


14 


IEGPTLRQCLAARA-GGGGGGGG-IEGPTLRQCLAARA 


354 


- 


15 


I I 

(cyclic) 

I EG PTLRQCLAARA-G GGGGGGG- 
IEGPTLRQCLAARA (linear) 


355 


- 


16 


IEGPTLRQALAARA-GGGGGGGG- 
I EG PTLRQ ALAAR A 


356 




17a 


TMP-GGGKGGGG-TMP 


nr-7 


++++ 


17b 


TMP-GGGK(BrAc)GGGG-TMP 


358 


ND 


18 


TMP-GGGCGGGG-TMP 


359 


++++ 


19 


TMP-GGGK(PEG)GGGG-TMP 


360 


+++++ 


20 


TMP-GGGC(PEG)GGGG-TMP 


361 


+++++ 


21 


TMP-GGGN*GSGG-TMP 


362 


++++ 


22 


TMP-GGGCGGGG-TMP 

I 

TMP-GGGCGGGG-TMP 


363 
363 


++++ 



-97- 



WO 01/83525 



PCT/US01/14310 



Discussion , It is well accepted that MGDF acts in a way similar to 
hGH, i.e., one molecule of the protein ligand binds two molecules of the 
receptor for its activation. Wells et al ,(1996), Ann. Rev. Biochem . 65: 609- 
34. Now, this interaction is mimicked by the action of a much smaller 
5 peptide, TMP. However, the present studies suggest that this mimicry 
requires the concerted action of two TMP molecules, as covalent 
dimerization of TMP in either a C-C parallel or C»N sequential fashion 
increased the in vitro biological potency of the original monomer by a 
factor of greater than 10 3 . The relatively low biopotency of the monomer is 
1 0 probably due to inefficient formation of the noncovalent dimer. A 

preformed covalent repeat has the ability to eliminate the entropy barrier 
for the formation of a noncovalent dimer which is exclusively driven by 
weak, noncovalent interactions between two molecules of the small, 14- 
residue peptide. 

15 It is intriguing that this tandem repeat approach had a similar effect 

on enhancing bioactivity as the reported C-C dimerization is intriguing. 
These two strategies brought about two very different molecular 
configurations. The C-C dimer is a quasi-symmetrical molecule, while the 
tandem repeats have no such symmetry in their linear structures. Despite 

2 0 this difference in their primary structures, these two types of molecules 
appeared able to fold effectively into a similar biologically active 
conformation and cause the dimerization and activation of c-Mpl. These 
experimental observations provide a number of insights into how the two 
TMP molecules may interact with one another in binding to c-Mpl. First, 

2 5 the two C-termini of the two bound TMP molecules must be in relatively 
close proximity with each other, as suggested by data on the C-terminal 
dimer. Second, the respective N- and C-termini of the two TMP molecules 
in the receptor complex must also be very closely aligned with each other, 
such that they can be directly tethered together with a single peptide bond 
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to realize the near maximum activity-enhancing effect brought about by 
the tandem repeat strategy. Insertion of one or more (up to 14) glycine 
residues at the junction did not increase (or decrease) significantly the 
activity any further. This may be due to the fact that a flexible poly glycine 
5 peptide chain is able to loop out easily from the junction without causing 
any significant changes in the overall conformation. This flexibility seems 
to provide the freedom of orientation for the TMP peptide chains to fold 
into the required conformation in interacting with the receptor and 
validate it as a site of modification. Indirect evidence supporting this 

1 0 came from the study on peptide 13, in which a much more rigid b-turn- 
forming sequence as the linker apparently forced a deviation of the 
backbone alignment around the linker which might have resulted in a 
slight distortion of the optimal conformation, thus resulting in a moderate 
(10-fold) decrease in activity as compared with the analogous compound 

15 with a 4-Gly linker. Third, Trp9 in TMP plays a similar role as Trpl3 in 
EMP, which is involved not only in peptide:peptide interaction for the 
formation of dimers but also is important for contributing hydrophobic 
forces in peptide:receptor interaction. Results obtained with the W to C 
mutant analog, peptide 14, suggest that a covalent disulfide linkage is not 

2 0 sufficient to approximate the hydrophobic interactions provided by the 
Trp pair and that, being a short linkage, it might bring the two TMP 
monomers too close, therefore perturbing the overall conformation of the 
optimal dimeric structure. 

An analysis of the possible secondary structure of the TMP peptide 

2 5 can provide further understanding on the interaction between TMP and c- 
Mpl. This can be facilitated by making reference to the reported structure 
of the EPO mimetic peptide. Livnah etal. (1996), Science 273:464-75 The 
receptor-bound EMP has a b-hairpin structure with a b-turn formed by the 
highly consensus Gly-Pro-Leu-Thr at the center of its sequence. Instead of 
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GPLT, TMP has a highly selected GPTL sequence which is likely to form a 
similar turn. However, this turn-like motif is located near the N-terminal 
part in TMP. Secondary structure prediction using Chau-Fasman method 
suggests that the C-terminal half of the peptide has a tendency to adopt a 
5 helical conformation. Together with the highly conserved Trp at position 
9, this C-terminal helix may contribute to the stabilization of the dimeric 
structure. It is interesting to note that most of our tandem repeats are 
more potent than the C-terminal parallel dimer. Tandem repeats seem to 
give the molecule a better fit conformation than does the C-C parallel 

1 0 dimerization. The seemingly asymmetric feature of a tandem repeat 

might have brought it closer to the natural ligand which, as an asymmetric 
molecule, uses two different sites to bind two identical receptor molecules. 

Introduction of a PEG moiety was envisaged to enhance the in vivo 
activity of the modified peptide by providing it a protection against 

15 proteolytic degradation and by slowing down its clearance through renal 
filtration. It was unexpected that pegylation could further increase the in 
vitro bioactivity of a tandem repeated TMP peptide in the cell-based 
proliferation assay. 

Example 2 

20 Fc-TMP fusions 

TMPs (and EMPs as described in Example 3) were expressed in 
either monomeric or dimeric form as either N-terminal or C-terminal 
fusions to the Fc region of human IgGl. In all cases, the expression 
construct utilized the luxPR promoter promoter in the plasmid expression 

2 5 vector p AMG21 . 

Fc-TMP . A DNA sequence coding for the Fc region of human IgGl 
fused in-frame to a monomer of the TPO-mimetic peptide was constructed 
using standard PCR technology. Templates for PGR reactions were the 
pFc-A3 vector and a synthetic TMP gene. The synthetic gene was 
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10 



constructed from the 3 overlapping oligonucleotides (SEQ ID NOS: 364, 
365, and 366, respectively) shown below: 

1842-97 AAA AAA GGA TCC TCG AGA TTA AGC ACG AGC AGC CAG CCA 

CTG ACG CAG AGT CGG ACC 

1842-98 AAA GGT GGA GGT GGT GGT ATC GAA GGT CCG ACT CTG CGT 

1842-99 CAG TGG CTG GCT GCT CGT GCT TAA TCT CGA GGA TCC TTT 

TTT 

These oligonucleotides were annealed to form the duplex encoding an 
amino acid sequence (SEQ ID NOS: 367 and 368, respectively) shown 
below: 

15 AAAGGTGGAGGTGGTGGTATCGAAGGTCCGACTCTGCGTCAGTGGCTGGCTGCTCGTGCT 

1 + + + + + 60 

C C AGGC TGAGAC GC AGTC AC C GAC C GAC GAGC AC GA 
a KGGGGGIEGP TLRQWLAARA 

2 0 TAATCTCGAGGATCCTTTTTT 

61 _ +- +- 81 

AT T AGAGC TCC T AGG AAAAAA 
a * 

2 5 This duplex was amplified in a PCR reaction using 1842-98 and 1842-97 as 

the sense and antisense primers. 

The Fc portion of the molecule was generated in a PCR reaction 
with pFc-A3 using the primers shown below (SEQ ID NOS: 369 and 370): 

3 0 1216-52 AAC ATA AGT ACC TGT AGG ATC G . 

183 0-51 TTCGATACCA CCACCTCCAC CTTTACCCGG AGACAGGGAG AGGCTCTTCTGC 

The oligonucleotides 1830-51 and 1842-98 contain an overlap of 24 

3 5 nucleotides, allowing the two genes to be fused together in the correct 

reading frame by combining the above PCR products in a third reaction 
using the outside primers, 1216-52 and 1842-97. 

The final PCR gene product (the full length fusion gene) was 
digested with restriction endonucleases Xba l and BamHI, and then ligated 

4 0 into the vector p AMG21 and transformed into competent E. coli strain 

2596 cells as described for EMP-Fc herein. Clones were screened for the 
ability to produce the recombinant protein product and to possess the 
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gene fusion having the correct nucleotide sequence. A single such clone 
was selected and designated Amgen strain #3728. 

The nucleotide and amino acid sequences (SEQ ID NOS: 5 and 6) of 
the fusion protein are shown in Figure 7. 
5 Fc-TMP-TMP . A DNA sequence coding for the Fc region of human 

IgGl fused in-frame to a dimer of the TPO-mimetic peptide was 
constructed using standard PCR technology. Templates for PGR reactions 
were the pFc~A3 vector and a synthetic TMP-TMP gene. The synthetic 
gene was constructed from the 4 overlapping oligonucleotides (SEQ ID 
1 0 NOS: 371 to 374, respectively) shown below: 

183 0-52 AAA GGT GGA GGT GGT GGT ATC GAA GGT CCG 

ACT CTG CGT CAG TGG CTG GCT GCT CGT GCT 

15 183 0-53 ACC TCC ACC ACC AGC ACG AGC AGC CAG 

CCA CTG ACG CAG AGT CGG ACC 

183 0-54 GGT GGT GGA GGT GGC GGC GGA GGT ATT GAG GGC CCA ACC 

CTT CGC CAA TGG CTT GCA GCA CGC GCA 

20 

1830-55 AAA AAA AGG ATC CTC GAG ATT ATG CGC GTG CTG CAA GCC 

ATT GGC GAA GGG TTG GGC CCT CAA TAC CTC CGC CGC C 

The 4 oligonucleotides were annealed to form the duplex encoding an 

2 5 amino acid sequence (SEQ ID NOS: 375 and 376, respectively) shown 

below: 

AAAGGTGGAGGTGGTGGTATCGAAGGTCCGACTCTGCGTCAGTGGCTGGCTGCTCGTGCT 
1 + + + + -+ 60 

3 0 C C AGGC TGAGAC GC AGTC AC C GAC CGACGAGC AC GA 

a KGGGGGI EGP TLRQWLAARA 

GGTGGTGGAGGTGGCGGCGGAGGTATTGAGGGCCCAACCCTTCGCCAATGGCTTGCAGCA 
61 + + + + + + 120 

3 5 CCACCACCTCCACCGCCGCCTCCATAACTCCCGGGTTGGGAAGCGGTTACCGAACGTCGT 

a GGGGGGGGIEGPTLRQWLAA 
CGCGCA 

121 148 

4 0 GCGCGTATTAGAGCTCCTAGGAAAAAAA 

a R A *- 

This duplex was amplified in a PCR reaction using 1830-52 and 1830-55 as 
4 5 the sense and antisense primers. 

The Fc portion of the molecule was generated in a PCR reaction 
with pFc-A3 using the primers 1216-52 and 1830-51 as described above for 
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Fc-TMP. The full length fusion gene was obtained from a third PCR 
reaction using the outside primers 1216-52 and 1830-55. 

The final PCR gene product (the full length fusion gene) was 
digested with restriction endonucleases Xba l and BamHI, and then ligated 
5 into the vector p AMG21 and transformed into competent E. coli strain 
2596 cells as described in example 1. Clones were screened for the ability 
to produce the recombinant protein product and to possess the gene 
fusion having the correct nucleotide sequence. A single such clone was 
selected and designated Amgen strain #3727. 

1 0 The nucleotide and amino acid sequences (SEQ ID NOS: 7 and 8) of 

the fusion protein are shown in Figure 8. 

TMP-TMP-Fc. A DNA sequence coding for a tandem repeat of the 
TPO-mimetic peptide fused in-frame to the Fc region of human IgGl was 
constructed using standard PCR technology. Templates for PCR reactions 

15 were the EMP-Fc plasmid from strain #3688 (see Example 3) and a 
synthetic gene encoding the TMP dimer. The synthetic gene for the 
tandem repeat was constructed from the 7 overlapping oligonucleotides 
shown below (SEQ ID NOS: 377 to 383, respectively): 



2 0 


1885-52 


TTT 


TTT 


CAT 


ATG 


ATC 


GAA 


GGT 


CCG 


ACT 


CTG 


CGT 


CAG 


TGG 




1885-53 


AGC 


ACG 


AGC 


AGC 


CAG 


CCA 


CTG 


ACG 


CAG 


AGT 


CGG 


ACC 


TTC 






GAT 


CAT 


ATG 






















25 


1885-54 


CTG 


GCT 


GCT 


CGT 


GCT 


GGT 


GGA 


GGC 


GGT 


GGG 


GAC 


AAA 


ACT 






CAC 


ACA 


























1885-55 


CTG 


GCT 


GCT 


CGT 


GCT 


GGC 


GGT 


GGT 


GGC 


GGA 


GGG 


GGT 


GGC 






ATT 


GAG 


GGC 


CCA 




















3 0 
































1885-56 


AAG 


CCA 


TTG 


GCG 


AAG 


GGT 


TGG 


GCC 


CTC 


AAT 


GCC 


ACC 


CCC 






TCC 


GCC 


ACC 


ACC 


GCC 




















1885-57 


ACC 


CTT 


CGC 


CAA 


TGG 


CTT 


GCA 


GCA 


CGC 


GCA 


GGG 


GGA 


GGC 


35 




GGT 


GGG 


GAC 


AAA 


ACT 




















1885-58 


CCC 


ACC 


GCC 


TCC 


ccc 


TGC 


GCG 


TGC 


TGC 











These oligonucleotides were annealed to form the duplex shown encoding 
40 an amino acid sequence shown below (SEQ ID NOS 384 and 385): 
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10 



15 



20 



TTTTTTCATATGATCGAAGGTCCGACTCTGCGTCAGTGGCTGGCTGCTCGTGCTGGCGGT 

1 + — + + + — + + 60 

GTATACTAGCTTCCAGGCTGAGACGCAGTCACCGACCGACGAGCACGACCGCCA 
a MIEGPTLRQWLAARAGG- 

GGTGGCGGAGGGGGTGGCATTGAGGGCCCAACCCTTCGCCAATGGCTGGCTGCTCGTGCT 

61 — + + — + — — + -+ + 120 

CCACCGCCTCCCCCACCGTAACTCCCGGGTTGGGAAGCGGTTACCGAACGTCGTGCGCGT 
a GGGGGGIEGPTLRQWIiAARA 

GGTGGAGGCGGTGGGGACAAAACTCTGGCTGCTCGTGCTGGTGGAGGCGGTGGGGACAAA 

121 + + + + +- + 180 

CCCCCTCCGCCACCC 

a GGGGGDKTLAARAGGGGGDK 

AC TC AC AC A 
181 189 



a T H T 

This duplex was amplified in a PCR reaction using 1885-52 and 1885-58 as 
the sense and antisense primers. 

The Fc portion of the molecule was generated in a PCR reaction 
with DNA from the EMP-Fc fusion strain #3688 (see Example 3) using the 

2 5 primers 1885-54 and 1200-54. The full length fusion gene was obtained 

from a third PCR reaction using the outside primers 1885-52 and 1200-54. 

The final PCR gene product (the full length fusion gene) was 
digested with restriction endonucleases Xbal and BamHI, and then ligated 
into the vector pAMG21 and transformed into competent E. coli strain 

3 0 2596 cells as described for Fc-EMP herein. Clones were screened for the 

ability to produce the recombinant protein product and to possess the 
gene fusion having the correct nucleotide sequence. A single such clone 
was selected and designated Amgen strain #3798. 

The nucelotide and amino acid sequences (SEQ ID NOS: 9 and 10) 
35 of the fusion protein are shown in Figure 9. 

TMP-Fc . A DNA sequence coding for a monomer of the TPO- 
mimetic peptide fused in-frame to the Fc region of human IgGl was 
obtained fortuitously in the ligation in TMP-TMP-Fc, presumably due to 
the ability of primer 1885-54 to anneal to 1885-53 as well as to 1885-58. A 

4 0 single clone having the correct nucleotide sequence for the TMP-Fc 

construct was selected and designated Amgen strain #3788. 
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The nucleotide and amino acid sequences (SEQ ID NOS: 11 and 12) 
of the fusion protein are shown in Figure 10. 

Expression in E. coli . Cultures of each of the p AMG2 1 -Fc-f usi on 
constructs in E. coli GM221 were grown at 37 °C in Luria Broth medium 
5 containing 50 mg/ml kanamycin. Induction of gene product expression 
from the luxPR promoter was achieved following the addition of the 
synthetic autoinducer N-(3-oxohexanoyl)-DL-homoserine lactone to the 
culture media to a final concentration of 20 ng/ml. Cultures were 
incubated at 37 °C for a further 3 hours. After 3 hours, the bacterial 

1 0 cultures were examined by microscopy for the presence of inclusion 
bodies and were then collected by centrifugation. Refractile inclusion 
bodies were observed in induced cultures indicating that the Fc-fusions 
were most likely produced in the insoluble fraction in E. coli . Cell pellets 
were lysed directly by resuspension in Laemmli sample buffer containing 

15 10% b-mercaptoethanol and were analyzed by SDS-PAGE. In each case, an 
intense coomassie-stained band of the appropriate molecular weight was 
observed on an SDS-PAGE gel. 

pAMG21 . The expression plasmid pAMG21 can be derived from 
the Amgen expression vector pCFM1656 (ATCC #69576) which in turn be 

2 0 derived from the Amgen expression vector system described in US Patent 
No. 4,710,473. The pCFM1656 plasmid can be derived from the described 
pCFM836 plasmid (Patent No. 4,710,473) by: 

(a) destroying the two endogenous Ndel restriction sites by end 
filling with T4 polymerase enzyme followed by blunt end 

2 5 ligation; 

(b) replacing the DNA sequence between the unique AatH and Clal 
restriction sites containing the synthetic Pl promoter with a 
similar fragment obtained frompCFM636 (patent No. 4,710,473) 
containing the PL promoter (see SEQ ID NO: 386 below); and 
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(c) substituting the small DNA sequence between the unique Clal 
and Kpnl restriction sites with the oligonucleotide having the 
sequence of SEQ ID NO: 387. 
SEQ ID NO: 386: 

5 AatH 

5 ' CTAATTCCGCTCTCACCTACCAAACAATGCCCCCCTGCAAAAAATAAATTCATAT- 

3 ' TGCAGATTAAGGCGAGAGTGGATGGTTTGTTACGGGGGGACGTTTTTTATTTAAGTATA- 

-AAAAAACATACAGATAACCATCTGCGGTGATAAATTATCTCTGGCGGTGTTGACATAAA- 

1 0 - TT TT T TGTATGTC TAT TGGTAGACGC C AC TATT TAAT AGAGAC C GC C AC AAC TGTATTT - 

- TACC ACTGGCGGTGATACTGAGCAC AT 3 ' 

-ATGGTGACCGCCACTATGACTCGTGTAGC 5 ' 

Clai 

15 

SEQ ID NO: 387: 

5 ' C GATTTGATTC TAGAAGGAGGAATAAC ATATGGTT AAC GC GTTGGAATTC GGT AC 3 ' 
3 ' TAAACTAAGATCTTCCTCCTTATTGTATACCAATTGCGCAACCTTAAGC 5 ' 

CM Kpnl 

20 

The expression plasmid pAMG21 can then be derived from pCFM1656 by 
making a series of site-directed base changes by PGR overlapping oligo 
mutagenesis and DNA sequence substitutions. Starting with the Bglll site 
(plasmid bp # 180) immediately 5 7 to the plasmid replication promoter 

2 5 P C ppB and proceeding toward the plasmid replication genes, the base pair 

changes are as shown in Table B below. 
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Table B — Base pair changes resulting in pAMG21 

pAMG21 bp # bp in PCFM1656 bp changed to in pAMG21 



5 


# 204 


T/A 


C/G 




# 428 


A/T 


G/C 




# 509 


G/G 


A/T 




# 617 




insert two G/C 




# 679 


G/C 


T/A 


10 


# 980 


T/A 


C/G 




# 994 


G/C 


A/T 




# 1004 


A/T 


C/G 




# 1007 


C/G 


T/A 




# 1028 


A/T 


T/A 


15 


# 1047 


C/G 


T/A 




#1178 


G/C 


T/A 




# 1466 


G/C 


T/A 




#2028 


G/C 


bp deletion 




#2187 


C/G 


T/A 


20 


#2480 


A/T 


T/A 




# 2499-2502 


AGTG 


GTCA 






TCAC 


CAGT 


25 


#2642 


TCCGAGC 


7 bp deletion 






AGGCTCG 




#3435 


G/C 


A/T 




#3446 


G/C 


A/T 


30 


#3643 


A/T 


T/A 



bp 
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The DNA sequence between the unique Aatll (position #4364 in 
pCFM1656) and Sac II (position #4585 in pCFM1656) restriction sites is 
substituted with the DNA sequence (SEQ ID NO: 23) shown in Figures 
17A and 17B. During the ligation of the sticky ends of this substitution 
5 DNA sequence, the outside Aatll and SacI I sites are destroyed. There are 
unique Aatl l and Sac II sites in the substituted DNA. 

GM221 (Amgen #2596 ). The Amgen host strain #2596 is an E.coli K- 
12 strain derived from Amgen strain #393. It has been modified to contain 
both the temperature sensitive lambda repressor cI857s7 in the early ebg 

1 0 region and the lacI Q repressor in the late ebg region (68 minutes). The 
presence of these two repressor genes allows the use of this host with a 
variety of expression systems, however both of these repressors are 
irrelevant to the expression from luxP R . The untransformed host has no 
antibiotic resistances. 

15 The ribosome binding site of the cI857s7 gene has been modified to 

include an enhanced RBS, It has been inserted into the ebg operon 
between nucleotide position 1170 and 1411 as numbered in Genbank 
accession number M64441Gb_Ba with deletion of the intervening ebg 
sequence. The sequence of the insert is shown below with lower case 

2 0 letters representing the ebg sequences flanking the insert shown below 

(SEQ ID NO: 388): 

ttattttcgtGCGGCCGCACCATTATCACCGCCAGAGGTAAACTAGTCAACACGCACGGTGTTAGATATTTAT 
CC C TTGC GGTGATAGATTGAGC AC ATCGATTTGAT TC T AGAAGGAGGGATAATAT ATGAGC AC AAAAAAGAAA 
CCATTAACACAAGAGCAGCTTGAGGACGCACGTCGCCTTAAAGCAATTTATGAAAAAAAGAAAAATGAACTTG 
25 GCTTATCCCAGGAATCTGTCGCAGACAAGATGGGGATGGGGCAGTCAGGCGTTGGTGCTTTATTTAATGGCAT 
CAATGCATTAAATGCTTATAACGCCGCATTGCTTACAAAAATTCTCAAAGTTAGCGTTGAAGAATTTAGCCCT 
T C AATC GC C AGAGAATC TAG GAGATGTATGAAGC GGTTAGTATGC AGC C GT C AC TTAGAAGTGAGTATGAGTA 
CCCTGTTTTTTCTCATGTTCAGGCAGGGATGTTCTCACCTAAGCTTAGAACCTTTACCAAAGGTGATGCGGAG 
AGATGGGTAAGCACAACCAAAAAAGCCAGTGATTCTGCATTCTGGCTTGAGGTTGAAGGTAATTCCATGACCG 

3 0 CACCAACAGGCTCCAAGCCAAGCTTTCCTGACGGAATGTTAATTCTCGTTGACCCTGAGCAGGCTGTTGAGCC 

AGGTGATTTCTGCATAGCCAGACTTGGGGGTGATGAGTTTACCTTCAAGAAACTGATCAGGGATAGCGGTCAG 
GTGTTTTTACAACCACTAAACCCACAGTACCCAATGATCCCATGCAATGAGAGTTGTTCCGTTGTGGGGAAAG 
TTATCGC TAGTC AGTGGCCTGAAGAGACGTTTGGCTGATAGAC TAGTGGATCCACTAGTg 1 1 1 c tgc cc 

3 5 The construct was delivered to the chromosome using a 

recombinant phage called MMebg-cI857s7enhanced RBS #4 into Ftet/393. 
After recombination and resolution only the chromosomal insert described 
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above remains in the cell. It was renamed F'tet/ GM101. F'tet/GMlOl was 
then modified by the delivery of a lacI Q construct into the ebg operon 
between nucleotide position 2493 and 2937 as numbered in the Genbank 
accession number M64441Gb JBa with the deletion of the intervening ebg 
5 sequence. The sequence of the insert is shown below with the lower case 
letters representing the ebg sequences flanking the insert (SEQ ID NO: 
389) shown below: 

ggcggaaaccGACGTCCATCGAATGGTGCAAAACCTTTCGCGGTATGGCATGATAGCGCCCGGAA.GAGAGTCA 
ATTC AGGGTGGTGAATGTGAAAC C AGTAAC GTTATAC GATGTC GC AGAGT ATGC C GGTGTC T C T TATC AGAC C 

10 GTTTCCCGCGTGGTGAACCAGGCCAGCCACGTTTCTGCGAAAACGCGGGAAAAAGTCGAAGCGGCGATGGCGG 
AGCTGAATTACATTCCCAACCGCGTGGCACAACAACTGGCGGGCAAACAGTCGCTCCTGATTGGCGTTGCCAC 
CTCCAGTCTGGCCCTGCACGCGCCGTCGCAAATTGTCGCGGCGATTAAATCTCGCGCCGATCAACTGGGTGCC 
AGCGTGGTGGTGTCGATGGTAGAACGAAGCGGCGTCGAAGCCTGTAAAGCGGCGGTGCACAATCTTCTCGCGC 
AACGCGTCAGTGGGCTGATCATTAACTATCCGCTGGATGACCAGGATGCCATTGCTGTGGAAGCTGCCTGCAC 

15 TAATGTTCCGGCGTTATTTCTTGATGTCTCTGACCAGACACCCATCAACAGTATTATTTTCTCCCATGAAGAC 
GGTACGCGACTGGGCGTGGAGCATCTGGTCGCATTGGGTCACCAGCAAATCGCGCTGTTAGCGGGCCCATTAA 
GTTCTGTCTCGGCGCGTCTGCGTCTGGCTGGCTGGCATAAATATCTCACTCGCAATCAAATTCAGCCGATAGC 
GGAACGGGAAGGCGACTGGAGTGCCATGTCCGGTTTTCAACAAACCATGCAAATGCTGAATGAGGGCATCGTT 
CCCACTGCGATGCTGGTTGCCAACGATCAGATGGCGCTGGGCGCAATGCGCGCCATTACCGAGTCCGGGCTGC 

2 0 GCGTTGGTGC GGATATC TC GGTAGTGGGATAC GAC GATAC CGAAGAC AGC T C ATGTTATATC C C GC C GTTAAC 

CACCATCAAACAGGATTTTCGCCTGCTGGGGCAAACCAGCGTGGACCGCTTGCTGCAACTCTCTCAGGGCCAG 
GCGGTGAAGGGCAATCAGCTGTTGCCCGTCTCACTGGTGAAAAGAAAAACCACCCTGGCGCCCAATACGCAAA 
CCGCCTCTCCCCGCGCGTTGGCCGATTGATTAATGCAGCTGGCACGACAGGTTTGCCGACTGGAAAGCGGACA 
GTAAGGTACCATAGGATCCaggcacagga 

25 

The construct was delivered to the chromosome using a 
recombinant phage called AGebg-LacIQ#5 into F'tet/GMlOl. After 
recombination and resolution only the chromosomal insert described 
above remains in the cell. It was renamed F'tet/GM221. The F'tet episome 

3 0 was cured from the strain using acridine orange at a concentration of 25 

|Lig/ml in LB. The cured strain was identified as tetracyline sensitive and 
was stored as GM221. 



Expression . Cultures of pAMG21-Fc-TMP-TMP in £. coli GM221 in 
3 5 Luria Broth medium containing 50 |xg/ml kanamycin were incubated at 
37°C prior to induction. Induction of Fc-TMP-TMP gene product 
expression from the luxPR promoter was achieved following the addition 
of the synthetic autoinducer N-(3-oxohexanoyl)-DL-homoserine lactone to 
the culture media to a final concentration of 20 ng/ml and cultures were 
40 incubated at 37°C for a further 3 hours. After 3 hours, the bacterial 
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cultures were examined by microscopy for the presence of inclusion 
bodies and were then collected by centrifugation. Refractile inclusion 
bodies were observed in induced cultures indicating that the Fc-TMP-TMP 
was most likely produced in the insoluble fraction in E. coli . Cell pellets 
5 were lysed directly by resuspension in Laemmli sample buffer containing 
10% •-mercaptoethanol and were analyzed by SDS-PAGE. An intense 
Coomassie stained band of approximately 30kDa was observed on an 
SDS-PAGE gel. The expected gene product would be 269 amino acids in 
length and have an expected molecular weight of about 29.5 kDa. 

1 0 Fermentation was also carried out under standard batch conditions at the 
10 L scale, resulting in similar expression levels of the Fc-TMP-TMP to 
those obtained at bench scale. 

Purification of Fc-TMP-TMP . Cells are broken in water (1/10) by 
high pressure homogenization (2 passes at 14,000 PSI) and inclusion 

15 bodies are harvested by centrifugation (4200 RPM in J-6B for 1 hour). 

Inclusion bodies are solubilized in 6M guanidine, 50mM Tris, 8mM DTT, 
pH 8.7 for 1 hour at a 1/10 ratio. The solubilized mixture is diluted 20 
times into 2M urea, 50 mM tris, 160mM arginine, 3mM cysteine, pH 8.5. 
The mixture is stirred overnight in the cold and then concentrated about 

2 0 10 fold by ultafiltration. It is then diluted 3 fold with lOmM Tris, 1.5M 
urea, pH 9. The pH of this mixture is then adjusted to pH 5 with acetic 
acid. The precipitate is removed by centrifugation and the supernatant is 
loaded onto a SP-Sepharose Fast Flow column equilibrated in 20mM 
NaAc, 100 mM NaCl, pH 5(10mg/ml protein load, room temperature). 

2 5 The protein is eluted off using a 20 column volume gradient in the same 
buffer ranging from lOOmM NaCl to 500mM NaCl. The pool from the 
column is diluted 3 fold and loaded onto a SP-Sepharose HP column in 20 
mM NaAc, 150 mM NaCl, pH 5(10 mg/ml protein load, room 
temperature). The protein is eluted off using a 20 column volume gradient 
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in the same buffer ranging from 150 mM NaCl to 400 mM NaCl. The peak 

is pooled and filtered. 

Characterization of Fc-TMP activity . The following is a summary of 

in vivo data in mice with various compounds of this invention. 
5 Mice: Normal female BDF1 approximately 10-12 weeks of age. 

Bleed schedule: Ten mice per group treated on day 0, two groups 

started 4 days apart for a total of 20 mice per group. Five mice bled at each 

time point, mice were bled a minimum of three times a week. Mice were 

anesthetized with isoflurane and a total volume of 140-160 [xl of blood was 
10 obtained by puncture of the orbital sinus. Blood was counted on a 

Technicon HIE blood analyzer running software for murine blood. 

Parameters measured were white blood cells, red blood cells, hematocrit, 

hemoglobin, platelets, neutrophils. 

Treatments: Mice were either injected subcutaneously for a bolus 
15 treatment or implanted with 7-day micro-osmotic pumps for continuous 

delivery. Subcutaneous injections were delivered in a volume of 0.2 ml. 

Osmotic pumps were inserted into a subcutaneous incision made in the 

skin between the scapulae of anesthetized mice. Compounds were diluted 

in PBS with 0.1% BSA. All experiments included one control group, 
2 0 labeled "carrier" that were treated with this diluent only. The 

concentration of the test articles in the pumps was adjusted so that the 

calibrated flow rate from the pumps gave the treatment levels indicated in 

the graphs. 

Compounds: A dose titration of the compound was delivered to 
2 5 mice in 7 day micro-osmotic pumps. Mice were treated with various 

compounds at a single dose of 100 Mg/kg in 7 day osmotic pumps. Some 
of the same compounds were then given to mice as a single bolus injection. 

Activity test results: The results of the activity experiments are 
shown in Figures 11 and 12. In dose response assays using 7-day micro- 
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osmotic pumps, the maximum effect was seen with the compound of SEQ 
ID NO: 18 was at 100 jxg/kg/day; the 10 ]ag/kg/day dose was about 50% 
maximally active and 1 fig/kg/ day was the lowest dose at which activity 
could be seen in this assay system. The compound at 10 |Lig/kg/day dose 
5 was about equally active as 100 l^g/kg/ day unpegylated rHu-MGDF in 
the same experiment. 

Example 3 
Fc-EMP fusions 

Fc-EMP . A DNA sequence coding for the Fc region of human IgGl 
10 fused in-frame to a monomer of the EPO-mimetic peptide was constructed 
using standard PCR technology. Templates for PCR reactions were a 
vector containing the Fc sequence (pFc-A3, described in International 
application WO 97/23614, published July 3, 1997) and a synthetic gene 
encoding EPO monomer. The synthetic gene for the monomer was 
15 constructed from the 4 overlapping oligonucleotides (SEQ ID NOS: 390 to 
393, respectively) shown below: 

179 8-2 TAT GAA AGG TGG AGG TGG TGG TGG AGG TAC TTA CTC TTG 
CCA CTT CGG CCC GCT GAC TTG G 

20 

1798-3 CGG TTT GCA AAC CCA AGT CAG CGG GCC GAA GTG GCA AGA 
GTA AGT ACC TCC ACC ACC ACC TCC ACC TTT CAT 

1798-4 GTT TGC AAA CCG CAG GGT GGC GGC GGC GGC GGC GGT GGT 
25 ACC TAT TCC TGT CAT TTT 

1798-5 CCA GGT CAG CGG GCC AAA ATG ACA GGA ATA GGT ACC ACC 
GCC GCC GCC GCC GCC ACC CTG 

3 0 The 4 oligonucleotides were annealed to form the duplex encoding an 
amino acid sequence (SEQ ID NOS: 394 and 395, respectively) shown 
below: 

TATGAAAGGTGGAGGTGGTGGTGGAGGTACTTACTCTTGCCACTTCGGCCCGCTGACTTG 

35 i +~ ^ + + +- + 60 

TACTTTCCACCTCCACCACCACCTCCATGAATGAGAACGGTGAAGCCGGGCGACTGAAC 
b MKGGGGGGGTYSCHFGPLTW- 

GGTTTGCAAACCGCAGGGTGGCGGCGGCGGCGGCGGTGGTACCTATTCCTGTCATTTT 

40 si „__. + — + + — + + — ■ +— 133 

CCAAACGTTTGGCGTCCCACCGCCGCCGCCGCCGCCACCATGGATAAGGACAGTAAAACCGGGCGACTGGACC 
b VCKPQGGGGGGGGTYSCHF 

This duplex was amplified in a PCR reaction using 
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1798-18 GCA GAA GAG CCT CTC CCT GTC TCC GGG TAA 

AGG TGG AGG TGG TGG TGG AGG TAG TTA 
CTC T 

5 

and 

1798-19 CTA ATT GGA TCC ACG AGA TTA ACC ACC 

CTG CGG TTT GCA A 

10 

as the sense and antisense primers (SEQ ID NOS: 396 and 397, 
respectively). 

The Fc portion of the molecule was generated in a PCR reaction 
with pFc-A3 using the primers 

15 

1216-52 AAC ATA AGT ACC TGT AGG ATC G 

1798-17 AGA GTA AGT ACC TCC ACC ACC ACC TCC ACC TTT ACC CGG 

AGA CAG GGA GAG GCT CTT CTG C 

20 

which are SEQ ID NOS: 369 and 399, respectively. The oligonucleotides 
1798-17 and 1798-18 contain an overlap of 61 nucleotides, allowing the two 
genes to be fused together in the correct reading frame by combining the 
above PCR products in a third reaction using the outside primers, 1216-52 

25 and 1798-19. 

The final PCR gene product (the full length fusion gene) was 
digested with restriction endonucleases Xba l and BamHI, and then ligated 
into the vector pAMG21 (described below), also digested with Xba l and 
BamHI. Ligated DNA was transformed into competent host cells of E. coli 

3 0 strain 2596 (GM221, described herein). Clones were screened for the ability 
to produce the recombinant protein product and to possess the gene 
fusion having the correct nucleotide sequence. A single such clone was 
selected and designated Amgen strain #3718. 

The nucleotide and amino acid sequence of the resulting fusion 

3 5 protein (SEQ ID NOS: 15 and 16) are shown in Figure 13. 

EMP-Fc . A DNA sequence coding for a monomer of the EPO 
mimetic peptide fused in-frame to the Fc region of human IgGl was 
constructed using standard PCR technology. Templates for PCR reactions 
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were the pFC-A3a vector and a synthetic gene encoding EPO monomer. 
The synthetic gene for the monomer was constructed from the 4 
overlapping oligonucleotides 1798-4 and 1798-5 (above) and 1798-6 and 
1798-7 (SEQ ID NOS: 400 and 401, respectively) shown below: 

5 

1798-6 GGC CCG CTG ACC TGG GTA TGT AAG CCA CAA GGG GGT GGG 
GGA GGC GGG GGG TAA TCT CGA G 

1798-7 GAT CCT CGA GAT TAC CCC CCG CCT CCC CCA CCC CCT TGT 
10 GGC TTA CAT AC 

The 4 oligonucleotides were annealed to form the duplex encoding an 
amino acid sequence (SEQ ID NOS: 402 and 403, respectively) shown 
below: 

15 

GTTTGCAAACCGCAGGGTGGCGGCGGCGGCGGCGGTGGTACCTATTCCTGTCATTTTGGC 

1 — + + + + + — -+ 60 

GTCCCACCGCCGCCGCCGCCGCCACCATGGATAAGGACAGTAAAACCG 
A VCKPQGGG GGGGGTYSCHFG 

20 

CCGCTGACCTGGGTATGTAAGCCACAAGGGGGTGGGGGAGGCGGGGGGTAATCTCGAG 

61 _ + + + — + — + +- 122 

GGCGACTGGACCCATACATTCGGTGTTCCCCCACCCCCTCCGCCCCCCATTAGAGCTCCTAG 
A PLTWVC.KPQGGGGGGG* 

25 

This duplex was amplified in a PGR reaction using 



1798-21 TTA TTT CAT ATG AAA GGT GGT AAC TAT TCC TGT CAT TTT 

30 and 



1798-22 TGG ACA TGT GTG AGT TTT GTC CCC CCC GCC TCC CCC ACC 

CCC T 

3 5 as the sense and antisense primers (SEQ ID NOS: 404 and 405, 
respectively). 

The Fc portion of the molecule was generated in a PCR reaction 
with pFc-A3 using the primers 



40 1798-23 AGG GGG TGG GGG AGG CGG GGG GGA CAA AAC TCA CAC ATG 

TCC A 

and 

1200-54 GTT ATT GCT CAG CGG TGG CA 

45 

which are SEQ ID NOS: 406 and 407, respectively. The oligonucleotides 
1798-22 and 1798-23 contain an overlap of 43 nucleotides, allowing the two 
genes to be fused together in the correct reading frame by combining the 
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above PCR products in a third reaction using the outside primers, 1787-21 
and 1200-54. 

The final PCR gene product (the full length fusion gene) was 
digested with restriction endonucleases Xbal and BamH I, and then ligated 
5 into the vector pAMG21 and transformed into competent E. coli strain 
2596 cells as described above. Clones were screened for the ability to 
produce the recombinant protein product and to possess the gene fusion 
having the correct nucleotide sequence. A single such clone was selected 
and designated Amgen strain #3688. 

1 0 The nucleotide and amino acid sequences (SEQ ID NOS: 17 and 18) 

of the resulting fusion protein are shown in Figure 14. 

EMP-EMP-Fc. A DNA sequence coding for a dimer of the EPO- 
mimetic peptide fused in-frame to the Fc region of human IgGl was 
constructed using standard PCR technology. Templates for PCR reactions 

15 were the EMP-Fc plasmid from strain #3688 above and a synthetic gene 
encoding the EPO dimer. The synthetic gene for the dimer was 
constructed from the 8 overlapping oligonucleotides (SEQ ID NOS:408 to 
415, respectively) shown below: 

2 0 1869-23 TTT TTT ATC GAT TTG ATT CTA GAT TTG AGT TTT AAC TTT 

TAG AAG GAG GAA TAA AAT ATG 



25 



40 



1869-48 TAA AAG TTA AAA CTC AAA TCT AGA ATC AAA TCG ATA AAA 

AA 

1871-72 GGA GGT ACT TAC TCT TGC CAC TTC GGC CCG CTG ACT TGG 

GTT TGC AAA CCG 



1871-73 AGT GAG CGG GCC GAA GTG GCA AGA GTA AGT ACC TCC CAT 

3 0 ATT TTA TTC CTC CTT C 

1871-74 CAG GGT GGC GGC GGC GGC GGC GGT GGT ACC TAT TCC TGT 

CAT TTT GGC CCG CTG ACC TGG 

35 1871-7 5 AAA ATG ACA GGA ATA GGT ACC ACC GCC GCC GCC GCC GCC 

ACC CTG CGG TTT GCA AAC CCA 



1871-78 GTA TGT AAG CCA CAA GGG GGT GGG GGA GGC GGG GGG GAC 

AAA ACT CAC ACA TGT CCA 

1871-79 AGT TTT GTC CCC CCC GCC TCC CCC ACC CCC TTG TGG CTT 

ACA TAC CCA GGT CAG CGG GCC 
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The 8 oligonucleotides were annealed to form the duplex encoding an 
amino acid sequence (SEQ ID NOS: 416 and 417, respectively) shown 
below: 



5 T1 1 T TTT ATCGAT T TGATTC T AGAT TTGAGT T T T AAC TOT T AGAAGGAGGAAT AAAAT ATG 

1 + . + — — + + + + 60 

AAAAAATAGCTAAAC TAAGATC TAAACTC AAAATTGAAAATC TTC C TCCTTATTTTATAC 
a M 

10 GGAGGTACTTACTCTTGCCACTTCGGCCCGCTGACTTGGGTTTGCAAACCGCAGGGTGGC 

61 — +- + - + + -+ + 120 

CCTCCATGAATGAGAACGGTGAAGCCGGGCGACTGAACCCAAACGTTTGGCGTCCCACCG 
a GGTY S CHF GPLTWVCKPQGG 

15 GGCGGCGGCGGCGGTGGTACCTATTCCTGTCATTTTGGCCCGCTGACCTGGGTATGTAAG 

121 +■ + + + + + 180 

CCGCCGCCGCCGCCACCATGGATAAGGACAGTAAAACCGGGCGACTGGACCCATACATTC 
a GG.GGGGTYSCHFGPLTWVCK 

2 0 CCACAAGGGGGTGGGGGAGGCGGGGGGGACAAAACTCACACATGTCCA 

181 +- + — + ■ + 228 

GGTGTTCCCCCACCCCCTCCGCCCCCCCTGTTTTGA 
a PQGGGGGGGDKTHTCP 



2 5 This duplex was amplified in a PCR reaction using 1869-23 and 

1871-79 (shown above) as the sense and antisense primers. 

The Fc portion of the molecule was generated in a PCR reaction 
with strain 3688 DNA using the primers 1798-23 and 1200-54 (shown 
above). 

3 0 The oligonucleotides 1871-79 and 1798-23 contain an overlap of 31 

nucleotides, allowing the two genes to be fused together in the correct 
reading frame by combining the above PCR products in a third reaction 
using the outside primers, 1869-23 and 1200-54. 

The final PCR gene product (the full length fusion gene) was 

3 5 digested with restriction endonucleases Xbal and BamHI, and then ligated 

into the vector pAMG21 and transformed into competent E. coli strain 
2596 cells as described for Fc-EMP. Clones were screened for ability to 
produce the recombinant protein product and possession of the gene 
fusion having the correct nucleotide sequence. A single such clone was 

4 0 selected and designated Amgen strain #3813. 

The nucleotide and amino acid sequences (SEQ ID NOS: 19 and 20, 
respectively) of the resulting fusion protein are shown in Figure 15. There 
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is a silent mutation at position 145 (A to G, shown in boldface) such that 
the final construct has a different nucleotide sequence than the 
oligonucleotide 1871-72 from which it was derived. 

Fc-EMP-EMP . A DNA sequence coding for the Fc region of human 
5 IgGl fused in-frame to a dimer of the EPO-mimetic peptide was 

constructed using standard PCR technology. Templates for PCR reactions 
were the plasmids from strains 3688 and 3813 above. 

The Fc portion of the molecule was generated in a PCR reaction 
with strain 3688 DNA using the primers 1216-52 and 1798-17 (shown 
1 0 above). The EMP dimer portion of the molecule was the product of a 

second PCR reaction with strain 3813 DNA using the primers 1798-18 (also 
shown above) and SEQ ID NO: 418, shown below: 

179 8-20 CTA ATT GGA TCC TCG AGA TTA ACC CCC TTG TGG CTT ACAT 

15 

The oligonucleotides 1798-17 and 1798-18 contain an overlap of 61 
nucleotides, allowing the two genes to be fused together in the correct 
reading frame by combining the above PCR products in a third reaction 
using the outside primers, 1216-52 and 1798-20. 
2 0 The final PCR gene product (the full length fusion gene) was 

digested with restriction endonucleases Xba l and Bam HI, and then ligated 
into the vector pAMG21 and transformed into competent E. coli strain 
2596 cells as described for Fc-EMP. Clones were screened for the ability to 
produce the recombinant protein product and to possess the gene fusion 

2 5 having the correct nucleotide sequence. A single such clone was selected 

and designated Amgen strain #3822. 

The nucleotide and amino acid sequences (SEQ ID NOS: 21 and 22, 
respectively) of the fusion protein are shown in Figure 16. 

Characterization of Fc-EMP activity . Characterization was carried 

3 0 out in vivo as follows. 

Mice: Normal female BDF1 approximately 10-12 weeks of age. 



- 117- 



WO 01/83525 



PCT/US01/14310 



Bleed schedule: Ten mice per group treated on day 0, two groups 
started 4 days apart for a total of 20 mice per group. Five mice bled at 
each time point, mice were bled a maximum of three times a week. Mice 
were anesthetized with isoflurane and a total volume of 140-160 ml of 
5 blood was obtained by puncture of the orbital sinus. Blood was counted 
on a Technicon HIE blood analyzer running software for murine blood. 
Parameters measured were WBC, RBC, HCT, HGB, PUT, NEUT, LYMPH. 

Treatments: Mice were either injected subcutaneously for a bolus 
treatment or implanted with 7 day micro-osmotic pumps for continuous 

1 0 delivery. Subcutaneous injections were delivered in a volume of 0.2 ml. 
Osmotic pumps were inserted into a subcutaneous incision made in the 
skin between the scapulae of anesthetized mice. Compounds were diluted 
in PBS with 0.1% BSA. All experiments included one control group, 
labeled "carrier" that were treated with this diluent only. The 

15 concentration of the test articles in the pumps was adjusted so that the 

calibrated flow rate from the pumps gave the treatment levels indicated in 
the graphs. 

Experiments: Various Fc-conjugated EPO mimetic peptides (EMPs) 
were delivered to mice as a single bolus injection at a dose of 100 jug/kg. 
2 0 Fc-EMPs were delivered to mice in 7-day micro-osmotic pumps. The 

pumps were not replaced at the end of 7 days. Mice were bled until day 
51 when HGB and HCT returned to baseline levels. 

Example 4 
TNF~oc inhibitors 

2 5 Fc-TNF-oc inhibitors . A DNA sequence coding for the Fc region of 

human IgGl fused in-frame to a monomer of the TNF-a inhibitory peptide 
was constructed using standard PCR technology. The Fc and 5 glycine 
linker portion of the molecule was generated in a PCR reaction with DNA 
from the Fc-EMP fusion strain #3718 (see Example 3) using the sense 
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primer 1216-52 and the antisense primer 2295-89 (SEQ ID NOS: 369 and 
398 , respectively). The nucleotides encoding the TNF-a inhibitory peptide 
were provided by the PCR primer 2295-89 shown below: 

5 1216-52 AAC ATA AGT ACC TGT AGG ATC G 

22 95-89 CCG CGG ATC CAT TAG GGA CGG TGA CCC AGA GAG GTG TTT TTG TAG 

TGC GGC AGG AAG TCA CCA CCA CCT CCA CCT TTA CCC 

1 0 The oligonucleotide 2295-89 overlaps the glycine linker and Fc portion of 
the template by 22 nucleotides, with the PCR resulting in the two genes 
being fused together in the correct reading frame. 

The PCR gene product (the full length fusion gene) was digested 
with restriction endonucleases Ndel and BamHI, and then ligated into the 

1 5 vector pAMG21 and transformed into competent E. coli strain 2596 cells as 
described for EMP-Fc herein. Clones were screened for the ability to 
produce the recombinant protein product and to possess the gene fusion 
having the correct nucleotide sequence. A single such clone was selected 
and designated Amgen strain #4544. 

2 0 The nucleotide and amino acid sequences (SEQ ID NOS: 1055 and 

1056) of the fusion protein are shown in Figures 19 A and 19B. 

TNF-a inhibitor-Fc . A DNA sequence coding for a TNF-a inhibitory 
peptide fused in-frame to the Fc region of human IgGl was constructed 
using standard PCR technology. The template for the PCR reaction was a 

2 5 plasmid containing an unrelated peptide fused via a five glycine linker to 
Fc. The nucleotides encoding the TNF-a inhibitory peptide were 
provided by the sense PCR primer 2295-88, with primer 1200-54 serving as 
the antisense primer (SEQ ID NOS: 1117 and 407, respectively). The 
primer sequences are shown below: 



3 0 



35 



2295-88 GAA TAA CAT ATG GAC TTC CTG CCG CAC TAC AAA AAC ACC TCT CTG GGT 

CAC CGT CCG GGT GGA GGC GGT GGG GAC AAA ACT 



- 119- 



WO 01/83525 



PCT/US01/14310 



1200-54 GTT ATT GCT CAG CGG TGG CA 

The oligonucleotide 2295-88 overlaps the glycine linker and Fc portion of 
the template by 24 nucleotides / with the PGR resulting in the two genes 
5 being fused together in the correct reading frame. 

The PCR gene product (the full length fusion gene) was digested 
with restriction endonucleases Ndel and BamH I, and then ligated into the 
vector pAMG21 and transformed into competent E. coli strain 2596 cells as 
described for EMP-Fc herein. Clones were screened for the ability to 
1 0 produce the recombinant protein product and to possess the gene fusion 
having the correct nucleotide sequence. A single such clone was selected 
and designated Amgen strain #4543. 

The nucleotide and amino acid sequences (SEQ ID NOS: 1057 and 1058) of 
the fusion protein are shown in Figures 20A and 20B. 

1 5 Expression in E. coli . Cultures of each of the pAMG21-Fc-fusion 

constructs in E. coli GM221 were grown at 37 °C in Luria Broth medium 
containing 50 mg/ml kanamycin. Induction of gene product expression 
from the luxPR promoter was achieved following the addition of the 
synthetic autoinducer N-(3-oxohexanoyl)~DL-homoserine lactone to the 

2 0 culture media to a final concentration of 20 ng/ml. Cultures were 
incubated at 37 °C for a further 3 hours. After 3 hours, the bacterial 
cultures were examined by microscopy for the presence of inclusion 
bodies and were then collected by centrifugation. Refractile inclusion 
bodies were observed in induced cultures indicating that the Fc-fusions 

2 5 were most likely produced in the insoluble fraction in E. coli . Cell pellets 
were lysed directly by resuspension in Laemmli sample buffer containing 
10% |3-mercaptoethanol and were analyzed by SDS-PAGE. In each case, an 
intense coomassie-stained band of the appropriate molecular weight was 
observed on an SDS-PAGE gel. 
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Purification of Fc-peptide fusion proteins . Cells are broken in water 
(1/10) by high pressure homogenization (2 passes at 14,000 PSI) and 
inclusion bodies are harvested by centrifugation (4200 RPM in J-6B for 1 
hour). Inclusion bodies are solubilized in 6M guanidine, 50mM Tris, 8mM 
5 DTT, pH 8.7 for 1 hour at a 1 /10 ratio. The solubilized mixture is diluted 
20 times into 2M urea, 50 mM tris, 160mM arginine, 3mM cysteine, pH 8.5. 
The mixture is stirred overnight in the cold and then concentrated about 
10 fold by ultafiltration. It is then diluted 3 fold with lOmM Tris, 1.5M 
urea, pH 9. The pH of this mixture is then adjusted to pH 5 with acetic 

1 0 acid. The precipitate is removed by centrifugation and the supernatant is 
loaded onto a SP-Sepharose Fast Flow column equilibrated in 20mM 
NaAc, 100 mM NaCl, pH 5 (lOmg/ml protein load, room temperature). 
The protein is eluted from the column using a 20 column volume gradient 
in the same buffer ranging from lOOmM NaCl to 500mM NaCl. The pool 

1 5 from the column is diluted 3 fold and loaded onto a SP-Sepharose HP 

column in 20mM NaAc, 150mM NaCl, pH 5(10mg/ml protein load, room 
temperature). The protein is eluted using a 20 column volume gradient in 
the same buffer ranging from 150mM NaCl to 400mM NaCl. The peak is 
pooled and filtered. 

2 0 Characterization of activity of Fc-TNF-a inhibitor and TNF-a 

inhibitor -Fc . Binding of these peptide fusion proteins to TNF- a can be 
characterized by BIAcore by methods available to one of ordinary skill in 
the art who is armed with the teachings of the present specification. 

Example 5 

2 5 IL-1 Antagonists 

Fc~IL-l antagonist . A DNA sequence coding for the Fc region of 
human IgGl fused in-frame to a monomer of an IL-1 antagonist peptide 
was constructed using standard PCR technology. The Fc and 5 glycine 
linker portion of the molecule was generated in a PCR reaction with DNA 
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10 



from the Fc-EMP fusion strain #3718 (see Example 3) using the sense 
primer 1216-52 and the antisense primer 2269-70 (SEQ ID NOS: 369 and 
1118, respectively). The nucleotides encoding the IL-1 antagonist peptide 
were provided by the PCR primer 2269-70 shown below: 

1216-52 AAC ATA AGT ACC TGT AGG ATC G 

22 69-70 CCG CGG ATC CAT TAC AGC GGC AGA GCG TAC GGC TGC CAG TAA CCC 

GGG GTC CAT TCG AAA CCA CCA CCT CCA CCT TTA CCC 



The oligonucleotide 2269-70 overlaps the glycine linker and Fc portion of 
the template by 22 nucleotides, with the PCR resulting in the two genes 
being fused together in the correct reading frame. 

15 The PCR gene product (the full length fusion gene) was digested 

with restriction endonucleases Ndel and BamH I, and then ligated into the 
vector pAMG21 and transformed into competent E. coli strain 2596 cells as 
described for EMP-Fc herein. Clones were screened for the ability to 
produce the recombinant protein product and to possess the gene fusion 

2 0 having the correct nucleotide sequence. A single such clone was selected 
and designated Amgen strain #4506. 

The nucleotide and amino acid sequences (SEQ ID NOS: 1059 and 
1060) of the fusion protein are shown in Figures 21 A and 21B. 

IL-1 antagonist-Fc . A DNA sequence coding for an IL-1 antagonist 

2 5 peptide fused in-frame to the Fc region of human IgGl was constructed 

using standard PCR technology. The template for the PCR reaction was a 
plasmid containing an unrelated peptide fused via a five glycine linker to 
Fc. The nucleotides encoding the IL-1 antagonist peptide were provided 
by the sense PCR primer 2269-69, with primer 1200-54 serving as the 

3 0 antisense primer (SEQ ID NOS: 1119 and 407, respectively). The primer 

sequences are shown below: 
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2269-69 GAA TAA CAT ATG TTC GAA TGG ACC CCG GGT TAC TGG CAG CCG TAG GCT 

CTG CCG CTG GGT GGA GGC GGT GGG GAC AAA ACT 

1200-54 GTT ATT GCT CAG CGG TGG CA 

5 

The oligonucleotide 2269-69 overlaps the glycine linker and Fc portion of 
the template by 24 nucleotides, with the PCR resulting in the two genes 
being fused together in the correct reading frame. 

The PCR gene product (the full length fusion gene) was digested 

1 0 with restriction endonucleases Ndel and BamHI, and then ligated into the 
vector p AMG21 and transformed into competent E. coli strain 2596 cells as 
described for EMP-Fc herein. Clones were screened for the ability to 
produce the recombinant protein product and to possess the gene fusion 
having the correct nucleotide sequence. A single such clone was selected 

1 5 and designated Amgen strain #4505. 

The nucleotide and amino acid sequences (SEQ ID NOS: 1061 and 
1062) of the fusion protein are shown in Figures 22A and 22B. Expression 
and purification were carried out as in previous examples. 

Characterization of Fc-IL-1 antagonist peptide and IL-1 antagonist 

2 0 peptide-Fc activity . IL-1 Receptor Binding competition between IL-lfJ, IL- 
1RA and Fc-conjugated IL-1 peptide sequences was carried out using the 
IGEN system. Reactions contained 0.4 nM biotin-IL-lR + 15 nM IL-l-TAG 
+ 3 uM competitor + 20 ug/ml streptavidin-conjugate beads, where 
competitors were IL-1RA, Fc-IL-1 antagonist, IL-1 antagonist-Fc). 

2 5 Competition was assayed over a range of competitor concentrations from 
3 uM to 1.5 pM. The results are shown in Table C below; 
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Table C — Results from IL-1 Receptor Binding Competition Assay 



IL-1pep-Fc Fc«lL-1pep IL-1ra 

5 Kl 281.5 59.58 1.405 

EC50 530.0 112.2 2.645 

95% Confidence Intervals 

10 EC50 280.2 to 1002 54.75 to 229.8 1.149 to 

6.086 



15 



Kl 148.9 to 532.5 29.08 to 122.1 0.6106 to 

3.233 

Goodness of Fit 

R 2 0.9790 0.9687 0.9602 

2 0 Example 6 

VEGF-Antagonists 

Fc-VEGF Antagonist . A DNA sequence coding for the Fc region of 
human IgGl fused in-frame to a monomer of the VEGF mimetic peptide 
was constructed using standard PCR technology. The templates for the 

2 5 PCR reaction were the pFc-A3 plasmid and a synthetic VEGF mimetic 

peptide gene. The synthetic gene was assembled by annealing the 
following two oligonucleotides primer (SEQ ID NOS: 1120 and 1121, 
respectively): 

2293-11 GTT GAA CCG AAC TGT GAG ATC CAT GTT ATG TGG GAA TGG GAA 

3 0 TGT TTT GAA CGT CTG 

2293-12 GAG ACG TTC AAA ACA TTC CCA TTC CCA CAT AAC ATG GAT GTC 

ACA GTT CGG TTC AAC 

3 5 The two oligonucleotides anneal to form the following duplex encoding 
an amino acid sequence shown below (SEQ ID NOS 1122 and 1133): 



GTTGAACCGAACTGTGACATCCATGTTATGTGGGAATGGGAATGTTTTGAACGTCTG 

40 1 - + + + — +' + 57 

CAACTTGGCTTGACACTGTAGGTACAATACACCCTTACCCTTACAAAACTTGCAGAC 
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5 This duplex was amplified in a PCR reaction using 2293-05 and 2293-06 as 
the sense and antisense primers (SEQ ID NOS. 1125 and 1126). 

The Fc portion of the molecule was generated in a PCR reaction 
with the pFc-A3 plasmid using the primers 2293-03 and 2293-04 as the 
sense and antisense primers (SEQ ID NOS. 1123 and 1124, respectively). 
10 The full length fusion gene was obtained from a third PCR reaction using 
the outside primers 2293-03 and 2293-06. These primers are shown below: 



15 



2293-03 ATT TGA TTC TAG AAG GAG GAA TAA CAT ATG GAC AAA ACT CAC 

ACA TGT 

22 93-04 GTC ACA GTT CGG TTC AAC ACC ACC ACC ACC ACC TTT ACC CGG 

AGA CAG GGA 



22 93-05 TCC CTG TCT CCG GGT AAA GGT GGT GGT GGT GGT GTT GAA CCG 

2 0 AAC TGT GAC ATC 

2293-06 CCG CGG ATC CTC GAG TTA CAG ACG TTC AAA ACA TTC CCA 

The PCR gene product (the full length fusion gene) was digested 

2 5 with restriction endonucleases Ndel and Bam HI, and then ligated into the 

vector pAMG21 and transformed into competent E. coli strain 2596 cells as 
described for EMP-Fc herein. Clones were screened for the ability to 
produce the recombinant protein product and to possess the gene fusion 
having the correct nucleotide sequence. A single such clone was selected 

3 0 and designated Amgen strain #4523. 

The nucleotide and amino acid sequences (SEQ ID NOS: 1063 and 
1064) of the fusion protein are shown in Figures 23A and 23B. 
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VEGF antagonist -Fc . A DNA sequence coding for a VEGF mimetic 
peptide fused in-frame to the Fc region of human IgGl was constructed 
using standard PCR technology. The templates for the PCR reaction were 
the pFc~A3 plasmid and the synthetic VEGF mimetic peptide gene 
5 described above. The synthetic duplex was amplified in a PCR reaction 
using 2293-07 and 2293-08 as the sense and antisense primers (SEQ ID 
NOS. 1127 and 1128, respectively). 

The Fc portion of the molecule was generated in a PCR reaction 
with the pFc-A3 plasmid using the primers 2293-09 and 2293-10 as the 
10 sense and antisense primers (SEQ ID NOS. 1129 and 1130, respectively). 
The full length fusion gene was obtained from a third PCR reaction using 
the outside primers 2293-07 and 2293-10. These primers are shown below: 

2293-07 ATT TGA TTC TAG AAG GAG GAA TAA CAT ATG GTT GAA CCG AAC 

15 TGT GAC 

2233-08 ACA TGT GTG AGT TTT GTC ACC ACC ACC ACC ACC CAG ACG TTC 

AAA ACA TTC 

20 2293-09 GAA TGT TTT GAA CGT CTG GGT GGT GGT GGT GGT GAC AAA ACT 

CAC ACA TGT 

2293-10 CCG CGG ATC CTC GAG TTA TTT ACC CGG AGA CAG GGA GAG 

The PCR gene product (the full length fusion gene) was digested 

2 5 with restriction endonucleases Ndel and BamHI, and then ligated into the 

vector pAMG21 and transformed into competent E. coli strain 2596 cells as 
described for EMP-Fc herein. Clones were screened for the ability to 
produce the recombinant protein product and to possess the gene fusion 
having the correct nucleotide sequence. A single such clone was selected 

3 0 and designated Amgen strain #4524. 
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The nucleotide and amino acid sequences (SEQ ID NOS: 1065 and 
1066) of the fusion protein are shown in Figures 24 A and 24B. Expression 
and purification were carried out as in previous examples. 

5 Example 7 

MMP Inhibitors 

Fc-MMP inhibitor . A DNA sequence coding for the Fc region of 
human IgGl fused in-frame to a monomer of an MMP inhibitory peptide 
was constructed using standard PCR technology. The Fc and 5 glycine 
1 0 linker portion of the molecule was generated in a PCR reaction with DNA 
from the Fc-TNF-a inhibitor fusion strain #4544 (see Example 4) using the 
sense primer 1216-52 and the antisense primer 2308-67 (SEQ ID NOS: 369 
and 1131, respectively). The nucleotides encoding the MMP inhibitor 
peptide were provided by the PCR primer 2308-67 shown below: 



15 



20 



1216-52 AAC ATA AGT ACC TGT AGG ATC G 

23 08-67 CCG CGG ATC CAT TAG CAC AGG GTG AAA CCC CAG TGG GTG GTG 

CAA CCA CCA CCT CCA CCT TTA CCC 



The oligonucleotide 2308-67 overlaps the glycine linker and Fc portion of 
the template by 22 nucleotides, with the PCR resulting in the two genes 
being fused together in the correct reading frame. 

The PCR gene product (the full length fusion gene) was digested 

2 5 with restriction endonucleases Ndel and BamH I, and then ligated into the 

vector pAMG21 and transformed into competent E. coli strain 2596 cells as 
described for EMP-Fc herein. Clones were screened for the ability to 
produce the recombinant protein product and to possess the gene fusion 
having the correct nucleotide sequence. A single such clone was selected 

3 0 and designated Amgen strain #4597. 

The nucleotide and amino acid sequences (SEQ ID NOS: 1067 and 
1068) of the fusion protein are shown in Figures 25A and 25B. Expression 
and purification were carried out as in previous examples. 
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MMP Inhibitor-Fc . A DNA sequence coding for an MMP inhibitory 
peptide fused in-frame to the Fc region of human IgGl was constructed 
using standard PCR technology. The Fc and 5 glycine linker portion of the 
molecule was generated in a PCR reaction with DNA from the Fc-TNF-a 
5 inhibitor fusion strain #4543 (see Example 4). The nucleotides encoding 
the MMP inhibitory peptide were provided by the sense PCR primer 2308- 
66, with primer 1200-54 serving as the antisense primer (SEQ ID NOS: 
1132 and 407, respectively). The primer sequences are shown below: 

10 

230 8-66 GAA TAA CAT ATG TGC ACC ACC CAC TGG GGT TTC ACC CTG TGC 

GGT GGA GGC GGT GGG GAC AAA 

1200-54 GTT ATT GCT CAG CGG TGG CA 

15 

The oligonucleotide 2269-69 overlaps the glycine linker and Fc portion of 
the template by 24 nucleotides, with the PCR resulting in the two genes 
being fused together in the correct reading frame. 

The PCR gene product (the full length fusion gene) was digested 

2 0 with restriction endonucleases Ndel and BamHI, and then ligated into the 
vector pAMG21 and transformed into competent E. coli strain 2596 cells as 
described for EMP-Fc herein. Clones were screened for the ability to 
produce the recombinant protein product and to possess the gene fusion 
having the correct nucleotide sequence. A single such clone was selected 

2 5 and designated Amgen strain #4598. 

The nucleotide and amino acid sequences (SEQ ID NOS: 1069 and 
1070) of the fusion protein are shown in Figures 26 A and 26B. 

* * * 

The invention now being fully described, it will be apparent to one 
30 of ordinary skill in the art that many changes and modifications can be 

made thereto, without departing from the spirit and scope of the invention 
as set forth herein. 
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Abbreviations 

Abbreviations used throughout this specification are as defined 



below, unless otherwise defined in specific circumstances. 



5 


Ac 


acetyl (used to refer to acetylated residues) 




AcBpa 


acetylated p-benzoyl-L-phenylalanine 




ADCC 


antibody-dependent cellular cytotoxicity 




Aib 


arninoisobutyric acid 




bA 


beta-alanine 


10 


Bpa 


p-benzoyl-L-phenylalanine 




BrAc 


bromoacetyl (BrCH^O) 




BSA 


Bovine serum albumin 




Bzl 


Benzyl 




Cap 


Caproic acid 


15 


CTL 


Cytotoxic T lymphocytes 




CTLA4 


Cytotoxic T lymphocyte antigen 4 




DARC 


Duffy blood group antigen receptor 




DCC 


Dicylcohexylcarbodiimide 




Dde 


l-(4 / 4-dimethyl-2,6-dioxo-cyclohexylidene)ethyl 


20 


EMP 


Ervtrdropoietin-rrdmetic peptide 

J X XX 




ESI-MS 


Electron spray ionization mass spectrometry 




EPO 


Erythropoietin 




Fmoc 


fluorenylmethoxycarbonyl 




G-CSF 


Granulocyte colony stimulating factor 


25 


GH 


Growth hormone 




HCT 


hematocrit 




HGB 


hemoglobin 




hGH 


Human growth hormone 




HOBt 


1-Hydroxybenzotriazole 
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HPLC 


high performance liquid chromatography 




IL 


interleukin 




IL-R 


interleukin receptor 




IL-1R 


interleukin~l receptor 


5 


IL-lra 


interleukin-1 receptor antagonist 




Lau 


Laurie acid 




LPS 


lipopolysaccharide 




LYMPH 


lymphocytes 




MALDI-MS 


Matrix-assisted laser desorption ionization mass 


10 




spectrometry 




Me 


methyl 




MeO 


methoxy 




MHC 


major histocompatibility complex 




MMP 


matrix metalloproteinase 


15 


MMPI 


matrix metalloproteinase inhibitor 




1-Nap 


1-napthylalanine 




NEUT 


neutrophils 




NGF 


nerve growth factor 




Me 


norleucine 


20 


NMP 


N-methyl-2-pyrrolidinone 




PAGE 


polyacrylamide gel electrophoresis 




PBS 


Phosphate-buffered saline 




Pbf 


2 / 2 / 4/6,7-pendamethyldihydrobenzofuran-5-sulfonyl 




PCR 


polymerase chain reaction 


25 


Pec 


pipecolic acid 




PEG 


Poly(ethylene glycol) 




pGlu 


pyroglutamic acid 




Pic 


picolinic acid 




PLT 


platelets 
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pY 


phosphotyrosine 




RBC 


red blood cells 




RBS 


ribosome binding site 




RT 


room temperature (25 °C) 


5 


Sar 


sarcosine 




SDS 


sodium dodecyl sulfate 




STK 


serine-threonine kinases 




t-Boc 


tert-Butoxycarbonyl 




tBu 


tert-Butyl 


10 


TGF 


tissue growth factor 




THF 


thymic humoral factor 




TK 


tyrosine kinase 




TMP 


Thrombopoietin-rnimetic peptide 




TNF 


Tissue necrosis factor 


15 


TPO 


Thrombopoietin 




TRAIL 


TNF-related apoptosis-inducing ligand 




Trt 


trityl 




UK 


urokinase 




UKR 


urokinase receptor 


20 


VEGF 


vascular endothelial cell growth factor 




VIP 


vasoactive intestinal peptide 




WBC 


white blood cells 
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What is claimed is: 

1. Composition of matter claimsA composition of matter of the formula 

(X 1 ) a -F 1 -(X 2 ) b 

and multimers thereof, wherein: 
5 F 1 is an Fc domain; 

X 1 and X 2 are each independently selected from -(L^-P 1 , -(L^-P 1 - 
(L 2 ) d -P\ -(LVP^aV^-CL 3 )^ and -(LVPW^l -F 3 -(V) f F> 

P 1 , P 2 , P 3 , and P 4 are each independently sequences of 
pharmacologically active peptides; 
1 0 L 1 , L 2 , L 3 / and L 4 are each independently linkers; and 

a, b, c, d, e, and f are each independently 0 or 1, provided that at 
least one of a and b is 1. 

2. The composition of matter of Claim 1 of the formulae 

X 1 -F 1 

15 or 

F 1 -X 2 . 

3. The composition of matter of Claim 1 of the formula 

F 1 -(L 1 ) C -P 1 . 

4. The composition of matter of Claim 1 of the formula 

20 ^-(LVPML-VP 2 

5. The composition of matter of Claim 1 wherein F 1 is an IgG Fc domain. 

6. The composition of matter of Claim 1 wherein F 1 is an IgGl Fc domain. 

7. The composition of matter of Claim 1 wherein F 1 comprises the 
sequence of SEQ ID NO: 2. 

2 5 8. Claims specific to IL-1 lead compoundsThe composition of matter of 
Claim 1 wherein X 1 and X 2 comprise an IL-1 antagonist peptide 
sequence. 
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9. The composition of matter of Claim 8 wherein the IL-1 antagonist 
peptide sequence is selected from SEQ ID NOS: 212, 907, 908, 909, 910, 
917, and 979. 

10. The composition of matter of Claim 8 wherein the IL-1 antagonist 

5 peptide sequence is selected from SEQ ID NOS: 213 to 271, 671 to 906, 

911 to 916, and 918 to 1023. 

11. The composition of matter of Claim 8 wherein F 1 comprises the 
sequence of SEQ ID NO: 2. 

12. Claims specific to EPO lead compoundsThe composition of matter of 
1 0 Claim 1 wherein X 1 and X 2 comprise an EPO-mimetic peptide sequence. 

13. The composition of matter of Claim 12 wherein the EPO-mimetic 
peptide sequence is selected from Table 5. 

14. The composition of matter of Claim 12 wherein P 1 comprises the 
sequence of SEQ ID NO: 2. 

15 15. The composition of matter of Claim 12 comprising a sequence selected 
from SEQ ID NOS: 83, 84, 85, 124, 419, 420, 421, and 461. . 

16. The composition of matter of claim 12 comprising a sequence selected 
from SEQ ID NOS: 339 and 340. 

17. The composition of matter of Claim 12 comprising a sequence selected 
2 0 from SEQ ID NOS: 20 and 22. 

18. Claims specific to TPO lead compounds not covered by Bob Cook's 
caseThe composition of matter of Claim 3 wherein P 1 is a TPO-mimetic 
peptide sequence. 

19. The composition of matter of Claim 18 wherein P 1 is a TPO-mimetic 
2 5 peptide sequence selected from Table 6. 

20. The composition of matter of Claim 18 wherein F 1 comprises the 
sequence of SEQ ID NO: 2. 

21. The composition of matter of Claim 18, having a sequence selected from 
SEQ ID NOS: 6 and 12. 
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22. DNA, vector, host cell claims A DNA encoding a composition of 
matter of any of Claims 1 to 21. 

23. An expression vector comprising the DNA of Claim 22. 

24. A host cell comprising the expression vector of Claim 23. 
5 25. The cell of Claim 24, wherein the cell is an E. coli cell. 

26. General process claimsA process for preparing a pharmacologically 
active compound, which comprises 

a. selecting at least one randomized peptide that modulates the 
activity of a protein of interest; and 
10 b. preparing a pharmacologic agent comprising at least one Fc domain 

covalently linked to at least one amino acid sequence of the selected 
peptide or peptides. 

27. The process of Claim 26, wherein the peptide is selected in a process 
comprising one or more techniques selected from yeast-based 

15 screening, rational design, protein structural analysis, or screening of a 

phage display library, an E. coli display library, a ribosomal library, or 
a chemical peptide library. 

28. The process of Claim 26, wherein the preparation of the pharmacologic 
agent is carried out by: 

2 0 a. preparing a gene construct comprising a nucleic acid sequence 

encoding the selected peptide and a nucleic acid sequence encoding 
an Fc domain; and 
b. expressing the gene construct. 

29. The process of Claim 26, wherein the gene construct is expressed in an 
25 E. coli cell. 

30. The process of Claim 26, wherein the protein of interest is a cell surface 
receptor. 

31. The process of Claim 26, wherein the protein of interest has a linear 
epitope. 
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32. The process of Claim 26, wherein the protein of interest is a cytokine 
receptor. 

33. Claims specific to peptidesThe process of Claim 26, wherein the 
peptide is an EPO-mimetic peptide. 

5 34. The process of Claim 26, wherein the peptide is a TPO-mimetic 
peptide. 

35. The process of Claim 26, wherein the peptide is an IL-1 antagonist 
peptide. 

36. The process of Claim 26, wherein the protein of interest is selected 
1 0 from the TNF family. 

37. The process of Claim 26, wherein the peptide is a TNF-antagonist 
peptide. 

38. The process of Claim 26, wherein the peptide is a CTLA4-mimetic 
peptide. 

15 39. The process of Claim 26, wherein the peptide is selected from Tables 4 
to 20. 

40. The process of Claim 26, wherein the selection of the peptide is carried 
out by a process comprising: 

a. preparing a gene construct comprising a nucleic acid sequence 
2 0 encoding a first selected peptide and a nucleic acid sequence 

encoding an Fc domain; 

b. conducting a polymerase chain reaction using the gene construct 
and mutagenic primers, wherein 

i) a first mutagenic primer comprises a nucleic acid sequence 

2 5 complementary to a sequence at or near the 5' end of a coding 

strand of the gene construct, and 

ii) a second mutagenic primer comprises a nucleic acid sequence 
complementary to the 3' end of the noncoding strand of the 
gene construct. 
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41. The process of Claim 26, wherein the compound is derivatized. 

42. The process of Claim 26, wherein the derivatized compound comprises 
a cyclic portion, a cross-linking site, a non-peptidyl linkage, an N- 
terminal replacement, a C-terminal replacement, or a modified amino 

5 acid moiety. 

43. Claims specifying the Fc domainThe process of Claim 26 wherein the 
Fc domain is an IgG Fc domain. 

44. The process of Claim 26, wherein the vehicle is an IgGl Fc domain. 

45. The process of Claim 26, wherein the vehicle comprises the sequence of 
10 SEQIDNO:2. 

46. Claims to process reciting specific structureThe process of Claim 26, 
wherein the compound prepared is of the formula 

(X\-F 1 -(X\ 
and multimers thereof, wherein: 
1 5 F 1 is an Fc domain; 

X 1 and X 2 are each independently selected from -(V)-P\ -(L^-P 1 - 
(L 2 ) d -P 2 , -(L VP'KL V^-CLVP 9 / and -(L 1 ) c -P 1 -(L 2 ) d -P 2 -(L 3 ) e -P 3 -(L 4 ) r P 4 

P 1 , P 2 , P 3 , and P 4 are each independently sequences of 
pharmacologically active peptides; 
2 0 L 1 , L 2 , L 3 , and L 4 are each independently linkers; and 

a, b, c, d, e, and f are each independently 0 or 1, provided that at 
least one of a and b is 1. 

47. The process of Claim 46, wherein the compound prepared is of the 
formulae 

25 X 1 -F 1 
or 

F 1 -X 2 . 

48. The process of Claim 46, wherein the compound prepared is of the 
formulae 
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F 1 -(L 1 ) C -P 1 

or 

F 1 -(L 1 ) c -P 1 -(L 2 ) d -P 2 . 

49. Claims specifying the Fc domainThe process of Claim 46, wherein F 1 is 
5 an IgG Fc domain. 

50. The process of Claim 46, wherein F 1 is an IgGl Fc domain. 

51. The process of Claim 46, wherein F 1 comprises the sequence of SEQ ID 
NO: 2. 

52. Claims specific to isotope and toxin conjugated moleculesThe 

10 composition of matter of Claim 1, further comprising an effector 

molecule or domain selected from a group consisting of: 

a. radioisotopes; 

b. ricin A toxin; 

c. microbially derived toxins; 
15 d. biotin; 

e. streptavidin; and 

f. cytotoxic agents. 

53. The composition of matter of Claim 52, wherein the vehicle is an Fc 
domain. 

2 0 54. The composition of matter of Claim 52, wherein at least one 

pharmacologically active peptide is capable of binding a tumor-specific 
epitope. 

55, The composition of matter of Claim 52, wherein the effector molecule is 
a radioisotope. 

2 5 56. The composition of matter of Claim 55, wherein the radioisotope is 
selected from 90 Yttrium, 131 Iodine, ^Actinium, and 213 Bismuth. 
57. A process for preparing a composition of matter, which comprises: 
a. selecting at least one randomized peptide that specifically binds to a 
target epitope; and 
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b. preparing a pharmacologic agent comprising (i) at least one vehicle, 
(ii) at least one amino acid sequence of the selected peptide or 
peptides, and (iii) an effector molecule. 
58. The process of Claim 57, wherein the vehicle is an Fc domain. 
5 59. The process of Claim 57, wherein the target epitope is a tumor-specific 



60. The process of Claim 57, wherein the effector molecule is selected from: 

a. radioisotopes; 

b. ricin A toxin; 



epitope. 



10 



e. 



c. 



d. 



microbially derived toxins; 
biotin; 

streptavidin; and 



15 



f . cytotoxic agents. 

61. The process of Claim 60, wherein the effector molecule is a 
radioisotope. 

62. The process of Claim 61, wherein the radioisotope is selected from 



90- 



'Yttrium, 131 Iodine, ^Actinium, and 213 Bismuth. 
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FIGURE 1 
peptide selection 

i 

peptide optimization 

i 

formation of Fopeptide DNA construct 

i 

insertion of construct into expression vector 

i 

transfection of host cell with vector 

i 

expression of vector in host cell 

i 

Fc muitimer formation in host cell 

i 

isolation of Fc muitimer from host cell 
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FIGURE 2 
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FIGURE 3 
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FIGURE 4 

ATGGACAAAACTCACACATGTCCACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCGTCA 

T AC C TGT T T TGAGTGT GT AC AGGTGG AAC AGGTC GAGGC C T TGAGGAC C C C CC TGGC AGT 

a MDKTHTCPPCPAPELLGGPS 

GTCTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTC 

61 + 4- -4- — -4- — + 4- 120 

CAGAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAG 

a VF L FP PKPKDTLMISRTPEV 

ACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTG 

121 — — + + — 4- — + 4- + 180 

TGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCAC 

a TCVVVDVSHEDPEVKFNWYV 

GACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACG 
CTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGC 

a DGVEVHNAKTKPREEQYNST 

TACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTAC 

241 4- -4- — + — + 4- — 4- 300 

ATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTCATG 

a YRVVSVLTVLHQDWLNGKEY 

AAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCC 

301 -+ — — 4- — 4- 4- 4- + 360 

TTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGG 

a KCKVSNKALPAPIEKT I SKA 

AAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTGACC 

361 ■ 4- 4- + ■ -4- 4- : 4- 420 

TTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGACTGG 

a KGQPREPQVYTLPPSRDELT 

AAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTG 

421 — 4- -4- 4- -4- + — ■■ = — + 480 

TTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCAC 

a KNQVSLTCLVKGFYPSD I A V 

GAGTGGGAGAGC AATGGGCAGCCGGAGAACAACTACAAGACC ACGC C TC C CGTGCTGGAC 

481 4- 4- 4- — + 4- — + 540 

CTCACCGTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTG 

a EWESNGQPENNYKTTPPVLD 

TCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAG 

541 4- 4- +■ 4- 4- + 600 

AGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTC 

a SDGSFFLY SK LTVDKSRWQ Q 

GGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAGAAG 

601 + 4- + — +- 4-- — + 660 

CCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTC 

a GNVFSCSVMHEALHNHYTQK 

AGCCTCTCCCTGTCTCCGGGTAAA 

661 + 4- 684 

TCGGAGAGGGACAGAGGCCCATTT 

a SLSLSPGK 
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FIGURE 5 
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FIGURE 6 
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FIGURE 7 

xbai 
I 

TCTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGGACA 

l -4- + " + + + - + 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACCTGTTTTGAGTGTGTACAG 

MDKTHT C P - 

CACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCGTCAGTCTTCCTCTTCCCCCCAAAAC 

61 . + + ■ 4- + 4- 4- 120 

GTGGAACAGGTCGAGGCCTTGAGGACCCCCCTGGCAGTCAGAAGGAGAAGGGGGGTTTTG 
PCPAPELLGGPSVFLFPPKP- 

CCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGA 

12X ■■ 4- ■■ — - + ■ + — ■ + ■ = + 4- 180 

GGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGCACT 
KDTLMISRTPEVTCVVVDVS- 

GCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCATAATG 

181 4- 4- + + + 240 

CGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGCACCTCCACGTATTAC 
H EDPEVK. F NVtfYVDGVEVH N A - 

CCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGTCCTCA 

241 +— ; + 4- + ~ -4- 300 

GGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGT 
KTKPREE Q Y N S TYRVVSV L T - 

CCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAG 

301 ■■ + --4- 4- --- + + 4- 360 

GGCAGGACGTGGTCCTGACCGACTTACCGTTCCTCATGTTCACGTTCCAGAGGTTGTTTC 
VLHQDWLNGK. EYK.CKVSNKA- 

CCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCAC 

GGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGGTTTCCCGTCGGGGCTCTTGGTG 
LPAPIEKT I SKAKGQPRE F Q - 

AGGTGTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCT 

421 ■ ■ + < ^ 4- — -4- . + 480 

TCCACATGTGGGACGGGGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGA 
VYTLP PSRDELTKNQVSLT C - 

GCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGC 

481 — — -+■■ + ~ ■ — 4- — ■ 4- ~~+ - — ■ — < — + 540 

CGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCCTCTCGTTACCCGTCG 
LVKGFYP S DIAVEWESNGQ P - 

CGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCT 

541 -- 4- 4- +-- : + -4- -4- 600 

GCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTGAGGCTGCCGAGGAAGAAGGAGA 
ENNYKTT P PVLDSDGSFF LY- 

ACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCG 

601 — — +- ---4- + — . . + - + + 660 

TGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGCAGAAGAGTACGAGGC 
SKLTVDKS RWQQGNVF SC S V ~ 

TGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTA 

661 ~ + — . +■ + + ~+ 4- 720 

ACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCAT 
MHEALHNHYTQKSL SLSPGK- 

AAGGTGGAGGTGGTGGTATCGAAGGTCCGACTGTGCGTCAGTGGCTGGCTGCTCGTGCTT 

721 4- - + — 4- -+ --4- -4- 780 

TTCCACCTCCACCACCATAGCTTCCAGGCTGAGACGCAGTCACCGACCGACGAGCACGAA 
GGGGGI EG P T LRQWLAARA * - 

BamHI 
I 

AATCTCGAGGATCC 

781 — + 794 

TTAGAGCTCCTAGG 
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FIGURE 8 

Xbal 

] 

TCTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGGACAAAACTCACACATGTC 

1 ~ — + + 4-- 4- + 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACCTGTTTTGAGTGTGTACAG 

MDKTHTCP- 



CACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCGTCAGTCTTCCTCTTCCCCCCAAAAC 

GTGGAACAGGTCGAGGCCTTGAGGACCCCCCTGGCAGTCAGAAGGAGAAGGGGGGTTTTG 
c PCPAPELLGGPSVFLFPPKP- 



CGAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGA 

121 — +- + -4- — 4- + + 180 

GGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGCACT 
c KDTLMISRTPEVTCVVVDVS- 



GCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCATAATG 

. + + : + ■ + + : 4~ 240 

CGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGCACCTCCACGTATTAC 
c HEDPEVK F NWYVDGVEVHNA- 



CCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGTCCTCA 

241 +—■■ + — — — + + 4- — + 300 

GGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGT 
c KTKPREE Q YNS TYRVVSVLT- 

CCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAG 

301 ~ 4- + - - + 4- 4- 4- 360 

GGCAGGACGTGGTCCTGACCGACTTACCGTTCCTCATGTTCACGTTCCAGAGGTTGTTTC 
c VLHQDWLNGKEYKCKVSNKA- 

CCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCAC 

361 - - 4- — + + ~ — + — - 420 

GGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGGTTTCCCGTCGGGGCTCTTGGTG 
c L P A P I EKT I SKAKGQPRBP Q - 

AGGTGTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCT 

421 - ~4— - + : 4" : + : + - + 480 

TCCACATGTGGGACGGGGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGA 
c VYTLP PSR DELTKNQVS LT C - 

GCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGC 

481 : 4- : + ~ : — + : + 4" + 540 

CGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCCTCTCGTTACCCGTCG 
C LVKGFYP S DIAVEWESNGQ P - 

CGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCT 

541 : + + - _ : 4- : 4- = 4- 4- 600 

GCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTGAGGCTGCCGAGGAAGAAGGAGA 
c ENNYKTTPPVLDSDGSFFLY- 



ACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCG 

601 + . + : 4- . — 4- + — 4- 660 

TGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGCAGAAGAGTACGAGGC 
SKLTVDKS RWQQGNVF SC S V - 

TGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTA 

661 -+ 4- ^+ : + -~-+- 4- 720 

ACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCAT 
MHEALHNHYT QKSLSLSPGK- 

AAGGTGGAGGTGGTGGTATCGAAGGTCCGACTCTGCGTCAGTGGCTGGCTGCTCGTGCTG 

721 . 4- 4- + -4- 4- — — 4- 780 

TTCCACCTCCACCACCATAGCTTCCAGGCTGAGACGCAGTCACCGACCGACGAGCACGAC 
GGGGG I EG PTLRQWLAARAG- 

GTGGTGGAGGTGGCGGCGGAGGTATTGAGGGCCCAACCCTTCGCCAATGGCTTGCAGCAC 

781 4- + - + + ~ + 4- 840 

CACCACCTCCACCGCCGCCTCCATAACTCCCGGGTTGGGAAGCGGTTACCGAACGTCGTG 
GGGGGGGI EGPTLRQWLAAR- 



BamHI 

I 

GCGCATAATCTCGAGGATCCG 

841 — . 4- - + - 861 

CG CGT ATT AG AGCTC CT AGGC 

c 
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FIGURE 9 

Xbal 
1 

TCTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGATCGAAGGTCCGACTCTGC 

X — + + — : — + ~- + : + + 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACTAGCTTCCAGGCTGAGACG 

MIEGPTLR- 

GTCAGTGGCTGGCTGCTCGTGCTGGCGGTGGTGGCGGAGGGGGTGGCATTGAGGGCCCAA 

61 = + 4- : 4 + : + 120 

CAGTCACCGACCGACGAGCACGACCGCCACCACCGCCTCCCCCACCGTAACTCCCGGGTT 
QWLAARAGGGGGGGGIEG P T - 

CCCTTCGCCAATGGCTTGCAGCACGCGCAGGGGGAGGCGGTGGGGACAAAACTCACACAT 

+ + + : + : + + 180 

GGGAAGCGGTTACCGAACGTCGTGCGCGTCCCCCTCCGCCACCCCTGTTTTGAGTGTGTA 
LRQWLAARAGGGGGDKTHTC- 

GTCCACCTTGCCCAGCACCTGAACTCCTGGGGGGACCGTCAGTTTTCCTCTTCCCCCCAA 

181 + — -+ 4 + + 240 

CAGGTGGAACGGGTCGTGGACTTGAGGACCCCCCTGGCAGTCAAAAGGAGAAGGGGGGTT 
PPCPAPELLGGPSVFLFP P K - 

AACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACG 

241 : . . + : + ■ _ + + +~ ■: 4 300 

TTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGC 
PKDTLMI SRTPEVTCVVVDV- 

TGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCATA 

301 + + ~+ ■ + . 4- ■. — -+ 360 

ACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGCACCTCCACGTAT 
SHE D PE VKFNWYVDGVEVHN- 

ATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGTCC 

361 ; + - + , -+ -4 -4 — 4 420 

TACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACACCAGTCGCAGG 
AKTKPRE EQYNSTYRVVS V L - 

TCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACA 

421 + . + — . + - — + + -+ 480 

AGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTCATGTTCACGTTCCAGAGGTTGT 
TVLHQD W LNGKEYKC KVS N K - 

AAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAAC 

481 + — ■ - + — + + + 4 540 

TTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGGTTTCCCGTCGGGGCTCTTG 
ALPAPIEKTISKAKGQPREP- 

CACAGGTGTACACCCTGCCGCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGA 

541 +■ + + ■. +~^~ + + 600 

GTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACT 
QVYTLP P SRDELTKNQVS L T - 

CCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGGC 

601 + + 4 + ■ — +- 4- 660 

GGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCAGCTCACCCTCTCGTTACCCG 
CliVKGFY PSDIAVEWE SNG Q - 

AGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCC 

661 4 ■■ + ■■ 4- - — 4- 4 720 

TCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTGAGGCTGCCGAGGAAGAAGG 
PENNYKTTPPVLDSDGSFF L - 

TCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCT 

721 ■ + -4- ■ + - 4 ■■ + ■ + 780 

AGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGCAGAAGAGTACGA 
YSKLTVDKSRWQQG3XTVFSC S - 

CCGTGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGG 

781 . —4- +• -~ + ~ ^4 4- 4- 840 

GGGACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCC 
VMHEALHNHYTQKSLSLSPG- 

BamHI 
i 

GTAAATAATGGATCC 

S41 + — 855 

CATTTATTACCTAGG 
K * 
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FIGURE 10 

Xbal 
I 

TCTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGATCGAAGGTCCGACTCTGC 

l + + : + ■ + J" ■ 4- 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACTAGCTTCCAGGCTGAGACG 

MIEGPTLR- 

GTCAGTGGCTGGCTGCTCGTGCTGGTGGAGGCGGTGGGGACAAAACTCACACATGTCCAC 

61 + + — + 4- 4- 4- 120 

C AGTC ACC G AC CG ACG AGC ACG AC C AC CT C CGC C AC CCCTGTTTTG AGTGTGT AC AGGTG 
QWLAARAGGGGGDKTHTCPP- 

CTTGCCCAGCACCTGAACTCCTGGGGGGACCGTCAGTTTTCCTCTTCCCCCCAAAACCCA 

121 + + + -+ ■ ■■ — + 4- 180 

GAACGGGTCGTGGACTTGAGGACCCCCCTGGCAGTCAAAAGGAGAAGGGGGGTTTTGGGT 
C PAPELLGGP SVFLF PPK P K - 

AGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGAGCC 

181 : + : ~ 4--- -+ 4- ■ + ~ ■ 4- 240 

TCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGCACTCGG 
DTLMISRTPEVTCVVVDVSH- 

ACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCATAATGCCA 

241 -+ - + -- 4- + - — • 4- = 4- 300 

TGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGCACCTCCACGTATTACGGT 
EDPEVKFNWYVDGVEV HNAK- 

AGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGTCCTCACCG 

301 — — + = 4- ■■ 4- -= — 4- + 4- 360 

TCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGTGGC 
TKPREE QYNS TYRVVSVL TV- 

TCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAGCCC 

361 — ■ ■■ + — ■■ + 4-- -+ 4- — ■■ + 420 

AGGACGTGGTCCTGACCGACTTACCGTTCCTCATGTTCACGTTCCAGAGGTTGTTTCGGG 
L H Q DWLNGKE YK C KVSNKAL- 

TCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCACAGG 

421 , + , — 4- 4- ■■ — ■ 4- 4— - ■■ — 4- 480 

AGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGGTTTCCCGTCGGGGCTCTTGGTGTCC 
PAPIEKT1 SKAKGQPREP Q V - 

TGTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCTGCC 

481 : 4- 4- + : 4- +~ : 540 

ACATGTGGGACGGGGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGACGG 
YTLPPSRDELTKNQVSLTCL- 

TGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGCCGG 

541 4- --4- 4- 4- + : — ; 4- 600 

ACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCCTCTCGTTACCCGTCGGCC 
VKGFYPSDIAVEWESNGQPE- 

AGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCTACA 

,601 + - + ■■ + ■ - + — ■ + 660 

TCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTGAGGCTGCCGAGGAAGAAGGAGATGT 
NNYKTTPPVLDSDGSFF LYS- 

GCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCGTGA 

661 ■ + = 4- 4- 4— 4— ■ + 720 

CGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGCAGAAGAGTACGAGGCACT 
K LTVDKSR WQ QGNVF SCSVM- 

TGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTAAAT 

721 + + + - + + + 780 

ACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCATTTA 
HEALH NHYTQKSLSLSPGK * - 

BamHI 
i 

AATGGATCC 

781 789 

TTACCTAGG 
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FIGURE 13 

Xbai 



X - : + , + + _ + ~ + + 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACCTGTTTTGAGTGTGTACAG 
C MDKTHTCP- 

CACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCGTCAGTCTTCCTCTTCCCCCCAAAAC 
6 1 + + _. + + : + 120 

GTGGAACAGGTCGAGGCCTTGAGGACCCCCCTGGCAGTCAGAAGGAGAAGGGGGGTTTTG 
c PCPAPELLGGPSVFLFPPKP- 

CCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGA 

121 > + — ■■ + + ■ + + + 180 

GGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGCACT 
c KDTLMISRTPEVTCVVVDVS- 

GCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCATAATG 

181 ~ ■. + . + + + . + 240 

CGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGCACCTCCACGTATTAC 
c HEDPEVKFNWYVDGVEVHNA- 

CCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGTCCTCA 

241 ■ + — + ■ 4- ■■ — . + -+ 300 

GGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGT 
c K T K P R E E QYNS TYRVVSVLT- 

CCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAG 

301 +- + , - + — + -_, _^+_ -+ 360 

GGCAGGACGTGGTCCTGACCGACTTACCGTTCCTCATGTTCACGTTCCAGAGGTTGTTTC 
c VLHQDWLNGKE YKCKVSNKA- 

CCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCAC 

361 + + ■■ + + ■. -f + 420 

GGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGGTTTCCCGTCGGGGCTCTTGGTG 
c L PAPIEKTI SKAKGQ PREPQ- 

AGGTGTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCT 

421 ■ -+ + + - + + — + 480 

TCCACATGTGGGACGGGGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGA 
c VYTLPPSRDELTKNQVSLTC- 

GCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGC 

481 ~ ■■ + + +- — . ■ + — — + 540 

CGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCCTCTCGTTACCCGTCG 
C LVKGFYP S DIAVEWE S NGQ P ~ 

CGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCT 

541 4- +■ + + + — + 600 

GCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTGAGGCTGCCGAGGAAGAAGGAGA 
C ENNY'KTT P PVL D SDG S F F L Y - 

ACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCG 

601 ~ — ■ . — + — + _ + — . — + - — 4- + 660 

TGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGCAGAAGAGTACGAGGC 
C SKLTVDKS RWQ Q GNVF SC S V - 

TGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTA 

661 ~ + -4-- + 4- + 4- 720 

ACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCAT 
C MHEALHNHYTQK SLSLS PGK- 

AAGGTGGAGGTGGTGGTGGAGGTACTTACTCTTGCCACTTCGGCCCGCTGACTTGGGTTT 

721 + + + 4- + 780 

TTCCACCTCCACCACCACCTCCATGAATGAGAACGGTGAAGCCGGGCGACTGAACCCAAA 
c GGGGGGGTYSC H FGP DTWVC- 

BamHI 
I 

GCAAACCGCAGGGTGGTTAATCTCGTGGATCC 

781 + + - + — 812 

CGTTTGGCGTCCCACCAATTAGAGCACCTAGG 
C K P Q G G * 
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FIGURE 14 

Xbal 



1 4- ■ -4- ■■ + + + 4- 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACCCTCCATGAATGAGAACGG 
c MGGTYSCH- 

ACTTCGGCCCGCTGACTTGGGTATGTAAGCCACAAGGGGGTGGGGGAGGCGGGGGGGACA 

61 4- ■ + 4- + + 120 

TGAAGCCGGGCGACTGAACCCATACATTCGGTGTTCCCCCACCCCCTCCGCCCCCCCTGT 
c FGPLTWVCKPQ GGGGGGGDK- 

AAACTCACACATGTCCACCTTGCCCAGCACCTGAACTCCTGGGGGGACCGTCAGTTTTCC 

121 + + -■ + 4- +~ + 180 

TTTGAGTGTGTACAGGTGGAACGGGTCGTGGACTTGAGGACCCCCCTGGCAGTCAAAAGG 
c THTCPPCPAPE L LGG PSVFL- 

TCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCG 

181 ■ 4- + 4- + +— . 4- 240 

AGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGC 
c FPPK PKDTLMI SRTP EVTCV- 

TGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCG 

241 + + , - + ~ -—4- 4- 300 

ACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGC 
c VVDVSHEDPEVKFNWYVDGV- 

TGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTG 

301 : 4- + 4--- : -4- -4- 360 

ACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCAC 
c EVHNAKTKPR E E QYNSTYRV- 

TGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCA 

361 4- ■ — 4- ■■ 4-. : 4- 4—: 4- 420 

ACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTCATGTTCACGT 
c V SVLTVLHQDW LNGKEYKCK- 

AGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGC 

421 4- : ■ 4- ■ + . - — + ~ 4" - + 480 

TCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGGTTTCCCG 
C VSNKALPAPIE K T I SKAKGQ- 

AGC C C C G AGAAC C AC AGGTGT AC AC C CTGC C C C C AT C CCGGG ATG AG CTG ACC AAG AAC C 

481 - — . + 4- 4--- 4- 4--- 4- 540 

TCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGACTGGTTCTTGG 
c PREPQVYTLPP SRDELTK NQ- 

AGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGG 

541 -4- 4- -4- 4- 4- 4- 600 

TCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCC 
C VSLTCLVKGFY P SD1 A V E W E - 

AGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACG 

601 — ■ +-> -4- 4--: -4- + + 660 

TCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTGAGGCTGC 
c SNGQ PENNYKT T P PVL.DSDG- 

GCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACG 

661 ■ 4- 4- — - + --4- 4- ■■ 4- 720 

CGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGC 

c sfflyskltvdksr wqqgnv- 

TCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCT 

721 + - — + , 4- + + + 780 

AGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGA 
C F SCSVMHEALHNHYTQKSLS- 

BairiHI 

CCCTGTCTCCGGGTAAATAATGGATCC 

781 — 4- 4- 807 

GGGAC AGAGGCC C ATTT ATT ACCT AGG 
c L S P G K * 
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FIGURE 15 

Xbal 

1 

TCTAGATTTGAGTTTTAACTTTTAGAAGGAGGAATAAAATATGGGAGGTACTTACTCTTG 

1 + - + + +■■ -- + -- — + 60 

AGATCTAAACTCAAAATTGAAAATCTTCCTCCTTATTTTATACCCTCCATGAATGAGAAC 

MGGTYSC- 

CCACTTCGGCCCACTGACTTGGGTTTGCAAACCGCAGGGTGGCGGCGGCGGCGGCGGTGG 

Si -- + - + : + + — +■ + 120 

GGTGAAGCCGGGTGACTGAACCCAAACGTTTGGCGTCCCACCGCCGCCGCCGCCGCCACC 
HFGPLTWVCKP QGGGGGGGG- 

TACCTATTCCTGTCATTTTGGCCCGCTGACCTGGGTATGTAAGCCACAAGGGGGTGGGGG 

121 — ■■ ■ — + ■ + + + +-- ■ + 180 

ATGGATAAGGACAGTAAAACCGGGCGACTGGACCCATACATTCGGTGTTCCCCCACCCCC 
TYSCHFGPLTWVCKPQGGGG- 

AGGCGGGGGGGACAAAACTCACACATGTCCACCTTGCCCAGCACCTGAACTCCTGGGGGG 

1 81 : : + + + — + + 240 

TCCGCCCCCCCTGTTTTGAGTGTGTACAGGTGGAACGGGTCGTGGACTTGAGGACCCCCC 
GGGDKTHTCPPC PAPEIiLGG- 

ACCGTCAGTTTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCC 

241 ■■ + + + + + — + 300 

TGGCAGTCAAAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGG 
PSVFLFPPKPKDTLMISRTP- 

TGAGGTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTG 

301 + + + - + +- ■ + 360 

ACTCCAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGAC 
EVTCVVVDVSHEDPEVKFNW- 

GTAGGTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAA 

361 : . + ■ * + : + + : + 420 

CATGCACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTT 
YVDGVEVHNAKTKPREEQYN- 

CAGCACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAA 

421 + + + + : + + 480 

GTCGTGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTT 
STYRVVSVLTVL HQDWLNGK- 

GGAGTACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTC 

481 + : ~+ — > + + — : + + 540 

CCTCATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAG 
EYKCKVSNKADPAPIEKTIS- 

CAAAGCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGA 

541 ; + + + : : + , + + 600 

GTTTGGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACT 
KAKGQPREPQVYTLPPSRDE- 

GCTGACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACAT 

601 4- ■ +~ + : + : + 660 

CGACTGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTA 
LTKNQVSLTCLVKGFYPSDI- 

CGCCGTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGT 

661 - + + ■ + + ■ ^-+- ~ + 720 

GCGGCACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCA 
AVEWESNGQPENNY KTTPPV- 

GCTGGACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTG 

721 ~~ -J- + + + +— + 780 

CGACCTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCAC 
LDSDGSFFI.YSKLTVDKSRW- 

GCAGCAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACAC 

781 + + + - + + - + 840 

CGTCGTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTG 
QQGNVFSCSVMHEALHNHYT- 

BamHI 

i 

GCAGAAGAGCCTCTCCCTGTCTCCGGGTAAATAATGGATCC 

841 — — — + --J- — -h- 881 

CGTCTTCTCGGAGAGGGACAGAGGCCCATTTATTACCTAGG 

b QKSLSLSPGK* 
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FIGURE 16 

Xbal 

1 

TCTAGATTTGTTTTAACTAATTAAAGGAGGAATJ^CATATGGACAAAACTCACACATGTC 

1 + _, + - + . + 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACCTGTTTTGAGTGTGTACAG 

MDKTHTCP- 

CACCTTGCCCAGCACCTGAACTCCTGGGGGGACCGTCAGTTTTCCTCTTCCCCCCAAAAC 

61 + ■■ + - ' + ■ + + — ■■ — = + 120 

GTGGAACGGGTCGTGGACTTGAGGACCCCCCTGGCAGTCAAAAGGAGAAGGGGGGTTTTG 
PCPAPELLGGP SVFLFPPKP- 

CCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGA 

121 + : + . + + : + — ~ — + 

GGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGCACT 
KDTLMISRTPEVTCVVVDVS- 

GCCACGAAGACCCTGAGGTCAAGTTCAAGTGGTACGTGGACGGCGTGGAGGTGCATAATG 

181 + ■ + -+ + - + 240 

CGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGCACCTCCACGTATTAC 
HED P EVKFNWY VDGVEVHNA- 

CCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGTCCTCA 

241 — + + + + + — + 300 

GGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGT 
KTKPREEQYNS TYRVVSVL.T- 

CCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAG 

301 + +■ + + ■ — + ■■ + 360 

GGCAGGACGTGGTCCTGACCGACTTACCGTTCCTCATGTTCACGTTCCAGAGGTTGTTTC 
VLHQDWLNGKE YKCKVSNKA- 

CCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCAC 

361 + — ■ +~ + + + + 420 

GGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGGTTTCCCGTCGGGGCTCTTGGTG 
LPAPIEKTI SKAKGQ P R E P Q - 

AGGTGTACACCCTGCCTCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCT 

421 +-< + ■ + + + 480 

TCCACATGTGGGACGGAGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGA 
VYTLP PSRDELTKNQVS LTC- 

GCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGC 

481 ~+ + + -+ + 540 

CGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCCTCTCGTTACCCGTCG 
LVKGFYPSDIAVEWE SNGQP- 

CGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCT 

541 _ : „ — + _+ . ~ + + — + 600 

GCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTGAGGCTGCCGAGGAAGAAGGAGA 
ENNYKTTP PVLD SDG S F F L Y - 

ACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCG 

601 : + + : + + : + + 660 

TGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTGGTCCCCTTGCAGAAGAGTACGAGGC 
S KLTVDKSRWQ Q GNVFSC S V ^ 

TGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTA 

661 + — : + — + ■■ +-■ + -+ 720 

ACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCAT 
MHEALHNHYTQK SLS L S P G K ~ 

AAGGTGG AGGTGGTGGCGG AGGT ACTT ACTCTTGC C ACTT C GGC C C ACTG ACTTGGGTTT 

721 + + - + +- -+ + 780 

TTCCACCTCCACCACCGCCTCCATGAATGAGAACGGTGAAGCCGGGTGACTGAACCCAAA 
GGGGGGGTYSC HFGPLT WVC- 

GCAAACCGCAGGGTGGCGGCGGCGGCGGCGGTGGTACCTATTCCTGTCATTTTGGCCCGC 

781 + + + + + — , , — + 840 

CGTTTGGCGTCCCACCGCCGCCGCCGCCGCCACCATGGATAAGGACAGTAAAACCGGGCG 
KPQGGGGGGGGTYSCHFGPL- 

BamHI 
1 

TGACCTGGGTATGTAAGCCACAAGGGGGTTAATCTCGAGGATCC 

841 + + -i- + 884 

ACTGGACCCATACATTCGGTGTTCCCCCAATTAGAGCTCCTAGG 
TWVCKPQGG* 
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FIGURE 17 A 



[ Aat I J sticky end] 
(position #4358 in pAMG2 1 ) 



5 ' GCGTAACGTATGCATGGTCTCC- 
3 ' TGCACGCATTGCATACGTACCAGAGG- 



- CCATGCGAGAGTAGGGAACTGCC AGGCATC AAATAAAACGAAAGGCTC AGTCGAAAGACT - 
-GGTACGCTCTCATCCCTTGACGGTCCGTAGTTTATTTTGCTTTCCGAGTCAGCTTTCTGA- 

-GGGCCTTTCGTTTTATCTGTTGTTTGTCGGTGAACGCTCTCCTGAGTAGGACAAATCCGC- 
-CCCGGAAAGCAAAATAGACAACAAACAGCCACTTGCGAGAGGACTCATCCTGTTTAGGCG- 

-CGGGAGCGGATTTGAACGTTGCGAAGCAACGGCCCGGAGGGTGGCGGGCAGGACGCCCGC- 
-GCCCTCGCCTAAACTTGCAACGCTTCGTTGCCGGGCCTCCCACCGCCCGTCCTGCGGGCG- 

-CATAAACTGCCAGGCATCAAATTAAGCAGAAGGCCATCCTGACGGATGGCCTTTTTGCGT- 
-GTATTTGACGGTCCGTAGTTTAATTCGTCTTCCGGTAGGACTGCCTACCGGAAAAACGCA- 

AatI I 

-TTCTACAAACTCTTTTGTTTATTTTTCTAAATACATTCAAATATGGACGTCGTACTTAAC- 
-AAGATGTTTGAGAAAACAAATAAAAAGATTTATGTAAGTTTATACCTGCAGCATGAATTG- 

-TTTTAAAGTATGGGCAATCAATTGCTCCTGTTAAAATTGCTTTAGAAATACTTTGGCAGC- 
-AAAATTTCATACCCGTTAGTTAACGAGGACAATTTTAACGAAATCTTTATGAAACCGTCG- 

-GGTTTGTTGTATTGAGTTTCATTTGCGCATTGGTTAAATGGAAAGTGACCGTGCGCTTAC- 
-CCAAACAACATAACTCAAAGTAAACGCGTAACCAATTTACCTTTCACTGGCACGCGAATG- 

-TACAGCCTAATATTTTTGAAATATCCCAAGAGCTTTTTCCTTCGCATGCCCACGCTAAAC- 
-ATGTCGGATTATAAAAACTTTATAGGGTTCTCGAAAAAGGAAGCGTACGGGTGCGATTTG- 

-ATTCTTTTTCTCTTTTGGTTAAATCGTTGTTTGATTTATTATTTGCTATATTTATTTTTC- 

- T AAGAAAAAG AGAAAAC C AAT T TAGC AAC AAAC T AAAT AAT AAACGATAT AAATAAAAAG - 

- G AT AATT AT C AAC T AGAG AAGGAAC AAT T AATGGT ATGT T C AT AC ACGC ATGT AAAAAT A- 
-CTATTAATAGTTGATCTCTTCCTTGTTAATTACCATACAAGTATGTGCGTACATTTTTAT- 

-AACTATCTATATAGTTGTCTTTCTCTGAATGTGCAAAACTAAGCATTCCGAAGCCATTAT- 

- TTGAT AGATAT ATC AAC AGAAAGAGAC T TAC AC GT TT TGAT T C GTAAGGC T T C GGT AAT A- 

- TAG C AGTATGAATAGGGAAAC TAAAC C C AGTGAT AAG AC C TGATGAT TTCGCTTCTT TAA- 
-ATCGTCATACTTATCCCTTTGATTTGGGTCACTATTCTGGACTACTAAAGCGAAGAAATT- 

-TTACATTTGGAGATTTTTTATTTACAGCATTGTTTTCAAATATATTCCAATTAATCGGTG- 
-AATGTAAACCTCTAAAAAATAAATGTCGTAACAAAAGTTTATATAAGGTTAATTAGCCAC- 

- AATGATTGGAGTTAGAATAATC TAC TATAGGATC ATATTTTATTAAATT AGCGTC ATC AT - 

- TTACTAACC T C AAT C T T ATTAGATGAT AT C C T AGTAT AAAATAAT T T AAT CGC AGT AGT A - 

-AATATTGCCTCCATTTTTTAGGGTAATTATCCAGAATTGAAATATCAGATTTAACCATAG- 
-TTATAACGGAGGTAAAAAATCCCATTAATAGGTCTTAACTTTATAGTCTAAATTGGTATC- 

-AATGAGGATAAATGATCGCGAGTAAATAATATTCACAATGTACCATTTTAGTCATATCAG- 
-TTACTCCTATTTACTAGCGCTCATTTATTATAAGTGTTACATGGTAAAATCAGTATAGTC- 

-ATAAGCATTGATTAATATCATTATTGCTTCTACAGGCTTTAATTTTATTAATTATTCTGT- 

- TATTC G T AAC T AATTAT AGTAATAACGAAGATG T C CGAAAT TAAAATAATTAATAAGAC A - 

-AAGTGTCGTCGGCATTTATGTCTTTCATACCCATCTCTTTATCCTTACCTATTGTTTGTC- 

- TTC ACAGCAGCCGTAAATACAGAAAGTATGGGTAGAGAAATAGGAATGGATAAC AAACAG - 

-GCAAGTTTTGCGTGTTATATATCATTAAAACGGTAATAGATTGACATTTGATTCTAATAA- 
-CGTTCAAAACGCACAATATATAGTAATTTTGCCATTATCTAACTGTAAACTAAGATTATT- 
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FIGURE 17B 



-ATTGGATTTTTGTCACACTATTATATCGCTTGAAATACAATTGTTTAACATAAGTACCTG- 
-TAACCTAAAAACAGTGTGATAATATAGCGAACTTTATGTTAACAAATTGTATTCATGGAC- 

- TAGGATC GT AC AGGTTTAC GC AAGAAAATGGTTTGTTATAGTC GATTAATCGATTTGATT - 
-ATCCTAGCATGTCCAAATGCGTTCTTTTACCAAACAATATCAGCTAATTAGCTAAACTAA- 

- C TAGATTTGTTTTAAC TAATTAAAGGAGGAAT AAC ATATGGTTAAC GC GTTGGAATTC GA- 
-GATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACCAATTGCGCAACCTTAAGCT- 

SacII 

-GCTCACTAGTGTCGACCTGCAGGGTACCATGGAAGCTTACTCGAGGATCCGCGGAAAGAA- 
-CGAGTGATCACAGCTGGACGTCCCATGGTACCTTCGAATGAGCTCCTAGGCGCCTTTCTT- 

- GAAG AAGAAGAAGAAAGC C C GAAAGGAAGC TGAGTT GGC TGC T GC C AC C GC T GAGC AATA- 
-CTTCTTCTTCTTCTTTCGGGCTTTCCTTCGACTCAACCGACGACGGTGGCGACTCGTTAT- 

-ACTAGCATAACCCCTTGGGGCCTCTAAACGGGTCTTGAGGGGTTTTTTGCTGAAAGGAGG- 

- TGAT C GT ATTGGGGAAC C C C GG AGATTTGC C C AGAAC TC CC C AAAAAAC GAC TTTC C T C C - 

-AACCGCTCTTCACGCTCTTCACGC 3' [SacII sticky end] 

-TTGGC GAGAAGTGCGAGAAGTG 5' (position #5904 in pAMG21) 
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FIGURE 18A 



Erythroid parameters EMP-Fc, single bolus injection. 
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FIGURE 18B 



Normal female BDF1 mice treated with 100ug/kg EMP-Fc 
in 7-day micro osmotic pumps 
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FIGURE 19A 



Ndel 



CATATGGACAAAACTCACACATGTCCACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCG 

X + + 4- + + -4- 60 

GTATACCTGTTTTGAGTGTGTACAGGTGGAACAGGTCGAGGCCTTGAGGACCCCCCTGGC 

MDKTHTC PPCPAPELLGGP 

TCAGTCTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAG 

61 + + 4- - + + 4- 120 

AGTCAGAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTC 

SVFLFPPKPKDTLMI SRTPE 

GTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTAC 

121 + — + + + + + 180 

CAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATG 

VTCVVVDVSHEDPEVKFNWY 

GTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGC 

181 + ■ • — 4- + 4- + --■ + 240 

CACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCG 

VDGVEVHNAKTKPREEQYNS 

ACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAG 

241 — + -4- — + 4- + -■ 4- 300 

TGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTC 

TYRVVSVL TVLHQDWLNGKE 

TACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAA 

301 + 4- — + + + — — 4- 360 

ATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTT 

YKCKVSNKALPAPIEKTI SK 

GCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTG 

361 4- + + + - + — + 420 

CGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGAC 

AKGQPREPQVYTLPPSRDEL 

ACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCC 

421 — - + — + 4- + 4- — 4- 480 

TGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGG 

TKNQVSLTCLVKGFYPSDIA 

GTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTG 

481 4-- + -4- 4- + + 540 

CACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGAC 

VEWES NGQP ENNYKTTPPVL 

GACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAG 

541 4- + + 4- + + 600 

CTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTC 

DSDGSFFLYSKLTVDKSRWQ 
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FIGURE 19B 

CAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAG 

601 + — + — + + - + + 660 

GTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTC 

QGNVF S C S VMHEALHNHYTQ 

AAGAGCCTCTCCCTGTCTCCGGGTAAAGGTGGAGGTGGTGGTGACTTCCTGCCGCACTAC 

TTCTCGGAGAGGGACAGAGGCCCATTTCCACCTCCACCACCACTGAAGGACGGCGTGATG 

KSLSLS PGKGGGGGDFL PHY 

BamHI 
I 

AAAAACACCTCTCTGGGTCACCGTCCGTAATGGATCC 

721 + • + — 757 

TTTTTGTGGAG AGAC C C AGTGGC AGGC ATTAC C TAGG 

KNTSLGHRP* 
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FIGURE 20 A 

Ndel 
I 

CATATGGACTTCCTGCCGCACTACAAAAACACCTCTCTGGGTCACCGTCCGGGTGGAGGC 

1 + + + __ + _ + + 60 

GTAT AC CTGAAGGAC GGC GTGATGTTTTTGTGGAGAGAC C C AGTGGC AGGC C C AC C TC C G 

MDFLPHYKNTSLGHRPGGG 

GGTGGGGACAAAACTCACACATGTCCACCTTGCCCAGCACCTGAACTCCTGGGGGGACCG 

61 4- — + ■ + +— — +-■ + 120 

CCACCCCTGTTTTGAGTGTGTACAGGTGGAACGGGTCGTGGACTTGAGGACCCCCCTGGC 

GGDKTHTC PPCPAPELL GGP 

TCAGTTTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAG 

121 — + — — + + — - + — + + 180 

AGTCAAAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTC 

SVFLFPPKPKDTLMI SRTPE 

GTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTAC 

181 + + + + + + 240 

CAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATG 

VTCVVVDVSHEDPEVKFNWY 

GTGGAC GGC GTGGAGGTGC ATAATGC C AAGAC AAAGC C GC GGGAGGAGC AGTAC AAC AGC 

241 — — + + + — — + - + + 300 

CACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCG 

VDGVEVHNAKTKPREEQYNS 

ACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAG 

301 — + -4- + +-- + — + 360 

TGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTC 

TYRVVSVL TVLHQDWLNGKE 

TACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAA 

361 : + + + + - + + 420 

ATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTT 

YKCKVSNKALPAPIEKTISK 

GCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTG 
421 + . + — + — + + + 480 

CGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGAC 

AKGQ P RE P QVYTLP P S RD EL 

ACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCC 

481 + + + + — + -+ 540 

TGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGG 

TKNQVSLrTCLVKGFYPS DIA 

GTGGAGTGGGAGAGC AATGGGC AGC CGGAGAAC AAC TAC AAGAC C AC GC C TCC C GTGC TG 

541 + + + + — + + 600 

CACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGAC 

VEWE SNGQ PENNYKTTP PVL 
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FIGURE 20B 

GACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAG 

601 + + - + + — + + 660 

CTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTC 

DSDGSFFLYSKLTVDKSRWQ 

CAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAG 

661 -■ + + + + + + 720 

GTCCCCTTGCAGAAGAGTACGAGGCAGTACGTACTCCGAGACGTGTTGGTGATGTGCGTC 

QGNVFS C SVMHEALHNHYTQ 



BamHI 

AAGAGCCTCTCCCTGTCTCCGGGTAAATAATGGATCCGCGG 

721 + + + - + - 761 

TTC TCGGAGAGGGAC AGAGGC C C ATTTATTAC C TAGGC GC C 

KSLSLSPGK* 
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FIGURE 21A 

Ndel 

CATATGGACAAAACTCACACATGTCCACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCG 

1 — + + — + + ■ 4- 4- 60 

GTATACCTGTTTTGAGTGTGTACAGGTGGAACAGGTCGAGGCCTTGAGGACCCCCCTGGC 

MDKTHTCP PCPAPELLGGP 

TCAGTCTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAG 

61 + + 4- + - + + 120 

AGTCAGAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTC 

SVFLFPPKPKDTLMISRTPE 

GTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTAC 

121 — + — + + + --■ -4- — + 180 

CAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATG 

VTCVVVDVSHEDPEVKFNWY 

GTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGC 

181 ■+ — 4-- 4- — 4- 4- 4- 240 

CACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCG 

VDGVEVHNAKTKPREEQYNS 

ACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAG 

241 -4- 4- 4- + -= 4-- — + 300 

TGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTC 

TYRVVS VLTVLHQDWLNG KE 

TAC AAGTGC AAGGT C TC C AAC AAAGC C C T C C C AGC C C C CAT C G AGAAAAC C ATC T C C AAA 

301 — — — + -4- ~ 4- 4- — r - + - + 360 

ATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTT 

YKCKVSKTKALPAPIEKTISK 

GCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTG 

361 — + — 4- 4- 4- — — + — 4- 420 

CGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGAC 

AKGQPREPQVYTLPPSRDEL 

ACCAAGAACCAGGTCAGCCTGACGTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCC 

421 4- + 4- — + 4- -4- 480 

TGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGG 

TKNQVSLT CLVKGFYPSDIA 

GTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTG 

481 + — ■ — - — 4- — --4- 4- 4- 4- 540 

GACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGAC 

VEWESNGQPENNYKTTPPVL 

GACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAG 

541 4- + 4- 4- + — + 600 

CTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCGACCGTC 

DS DGSFFLYSKLTVDKSRWQ 
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FIGURE 21B 

CAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAG 

GTC C C CTT GC AGAAG AGTAC GAGGC AC TAC GTAC TCCG AGAC GTGTTGGTGATGTGC GTC 

QGNVF SC SVMHEAL HNHYTQ 

AAGAGCCTCTCCCTGTCTCCGGGTAAAGGTGGAGGTGGTGGTTTCGAATGGACCCCGGGT 

TTCTCGGAGAGGGACAGAGGCCCATTTCCACCTCCACCACCAAAGCTTACCTGGGGCCCA 

KSLSLSPGKGGGGGFEWTPG 

BamHI 
I 

TACTGGCAGCCGTACGCTCTGCCGCTGTAATGGATCCCTCGAG 

721 + - — + — +■ + 763 

ATGACC GTCGGC ATGC GAGAC GGC GAC ATTAC C T AGGGAGC TC 

Y.WQPYALPL* 
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FIGURE 22A 

Ndel 
I 

CATATGTTCGAATGGACCCCGGGTTACTGGCAGCCGTACGCTCTGCCGCTGGGTGGAGGC 

1 — + - — + + + + + 60 

GTATACAAGCTTACCTGGGGCCCAATGACCGTCGGCATGCGAGACGGCGACCCACCTCCG 

MFEWTPGYWQPYALPLGGG 

GGTGGGGACAAAACTCACACATGTCCACCTTGCCCAGCACCTGAACTCCTGGGGGGACCG 
61 — _ + + 4- + — +- — + 120 

CCACCCCTGTTTTGAGTGTGTACAGGTGGAACGGGTCGTGGACTTGAGGACCCCCCTGGC 

GGDKTHTCPPCPAPELLGGP 

TCAGTTTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAG 

121 + - + + -■ + + + 180 

AGTCAAAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTC 

SVFLF PPKPKDTLMISRTPE 

GTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTAC 

181 + — + — — 4- 4- + + 240 

CAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATG 

VTCVVVDVSHEDPEVKFNWY 

GTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGC 

241 + + + + + + 300 

CACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCG 

VDGVEVHNAKTKPREEQYNS 

ACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAG 

301 - + - + — + + + + 360 

TGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTC 

TYRVVSVLTVLHQDWLNGKE 

TAC AAGTG C AAGGTC TC C AAC AAAGC C C TC C C AGC C C C CATC G AGAAAAC CATC TC C AAA 

361 + + + + -■ + — + 420 

ATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTT 

YKCKVSHKALPA PIEKTISK 

GCC AAAGGGC AGC CCC GAGAACC AC AGGTGTAC ACC CTGC C C C C ATC C CGGGATGAGCTG 

421 + + -= ■ + — + + + 480 

CGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGAC 

AKG Q PRE PQVYTLP PSRDEL 

ACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCC 

481 + + - + — + -f + 540 

TGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGG 

TKNQVSLT.CLVKGFYPSDIA 

GTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTG 

541 + + — + — + + -f 600 

CACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGAC 

VEWESNGQPENNYKTTPPVL 
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FIGURE 22B 

GACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAG 

601 + — + ~- — + + + + 660 

CTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTC 

DSDGSFFLYSKLTVDKSRWQ 

CAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCAGTACACGCAG 

661 + +— + + - + + 720 

GTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTC 

QGNVFSCSVMHEALHNHYTQ 

BamHI 
I 

AAGAGC CTC TC C C TGTC TC C GGGTAAATAATGGATC C 

721 - + — + — ' — + 757 

TTCTCGGAGAGGGAC AGAGGC C C ATTTATTACC TAGG 

KSLSLSPGK* 
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FIGURE 23A 

Ndel 

I 

CATATGGACAAAACTCACACATGTCCACCGTGCCCAGCACCTGAACTCCTGGGGGGACCG 

1 + - — + + ■ — — + + - + 60 

GTATACCTGTTTTGAGTGTGTACAGGTGGCACGGGTCGTGGACTTGAGGACCCCCCTGGC 

MDKTHTCPPCPAPELLGGP 

TCAGTTTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAG 
61 + + . + — + + + 120 

AGTC AAAAGGAGAAGGGGGGTTTTGGGTTC C TGTGGGAGTAC TAGAGGGC C TGGGGACTC 

SVFLFPPKPKDTLMISRTPE 

GTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTAC 

121 — . + + + + — + + 180 

CAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATG 

VTCVVVDVSHEDPEVKFNWY 

GTGGAC GGC GTGGAGGTGC ATAATGC C AAGACAAAGC C GC GGGAGGAGC AGTAC AAC AGC 

181 + ~ 4- + — ■ + + + 240 

CACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCG 

VDGVEVHNAKTKP REEQYNS 

ACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAG 

241 -+ + - + + + — : — + 300 

TGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTC 

TYRVVSVLTVLHQDWLNGKE 

TACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAA 

301 + + - + ~ + -+ — + 360 

ATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTT 

YKC KVSNKALPAP IEKTISK 

GC C AAAGGG C AGC C C C G AGAAC C AC AGGTGTAC AC CCTGCCCC C ATC C C GGGATGAGC TG 

361 --- + + „„„_ + + + + 420 

CGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGAC 

AKGQP REPQVYTLPPSRDEL, 

ACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCC 

421 + + + + + + 480 

TGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGG 

TKNQVSLTCLVKGFYPSDIA 

GTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTG 

481 + + + + — — + + 540 

CACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGAC 

VEWESNGQPENNY.KTTPPVL 

GACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAG 

541 + -~ + -- + + + + 600 

CTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTC 

DSDGSFF L YSKLTVDKSRWQ 
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FIGURE 23B 



CAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGGAG 

601 + + ■ + + -1-- + 660 

GTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTC 

Q GNVF SCSVMH EALHNHYTQ 

AAGAGCCTCTCCCTGTCTCCGGGTAAAGGTGGTGGTGGTGGTGTTGAACCGAACTGTGAC 

661 + + +~ + - + + 720 

TTCTCGGAGAGGGACAGAGGCCCATTTCCACCACCACCACCACAACTTGGCTTGACACTG 

KSLSLSPGKGGGGGVEPNCD 

BamHI 

ATC C ATGTT ATGTGGGAATGGGAATGTTTTGAAC GTCTGTAAC TC GAGGATC C 

721 + + - — + — -i- — — + — - 773 

TAGGTAC AATAC AC C CTTACC C TTAC AAAAC TTGC AGAC ATTGAGCTC CTAGG 

IHVMWEWECFERL* 
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FIGURE 24A 

Ndel 

CATATGGTTGAACCGAACTGTGACATCCATGTTATGTGGGAATGGGAATGTTTTGAACGT 

1 + + + + + + 60 

GTATACCAACTTGGCTTGACACTGTAGGTACAATACACCCTTACCCTTACAAAACTTGCA 

MVEPNCDIHVMWEWECFER 

CTGGGTGGTGGTGGTGGTGACAAAACTCACACATGTCCACCGTGCCCAGCACCTGAACTC 
61 + + . + - + + + 120 

GACCCACCACCACCACCACTGTTTTGAGTGTGTACAGGTGGCACGGGTCGTGGACTTGAG 

LGGGGGDKTHTCPPCPAPEL. 

CTGGGGGGACCGTCAGTTTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCC 

121 + — + + + . + -+ 180 

GAC C C C CC T GGC AGTC AAAAGGAGAAGGGGGGTTTTGGGTT C C TGTGGGAGTACTAGAGG 

L GGPSVFLFPP KPKDTLMIS 

CGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAG 

181 + + + + + + 240 

GCCTGGGGACTCCAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTC 

RTPEVTCVVVDVSHEDPEVK 

TTCAACTGGTACGTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAG 

241 + - + + - + -- = + — + 300 

AAGTTGACCATGCACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTC 

FNWYVDGVEVHNAKTKPREE 

CAGTACAACAGCACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTG 

301 - + + --■ + + + ■ + 360 

GTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGAC 

QYNSTYRVVSVLTVLHQDWL 

AATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAA 

361 + ■ + — + + — + — + 420 

TTACCGTTCCTCATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTT 

NGKEYKCKVSNKALPAPIEK 

ACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCC 

421 + + - + — + + + 480 

TGGTAGAGGTTTCGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGG 

T I S KAKGQ P RE P QVYTL P P S 

CGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCC 
481 — + + + + _ + + 540 

GCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGG 

RDELTKNQVSLTCL VKGFYP 

AGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACG 

541 + + + - — + + + 600 

TCGCTGTAGCGGCACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGC 

S D IAVEWE SNG Q PENNYKTT 
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FIGURE 24B 

CCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAG 

601 + --■ + + + - + + 660 

GGAGGGCACGACCTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTC 

P PVLDSDGS FFLYSKLTVDK 

AGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAAC 

661 — + - + -+ — + + + 720 

TCGTCCACCGTCGTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTG 

S RWQQGNVF S C SVMHEALHN 

BamHI 
i 

CACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTAAATAACTCGAGGATCC 

721 + — + — — + + + 773 

GTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCATTTATTGAGCTCCTAGG 

HYTQKSLSLSPGK* 
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FIGURE 25A 

Ndel 

CATATGGACAAAACTCACACATGTCCACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCG 
1 -f + + — + — + + 60 

GTATACCTGTTTTGAGTGTGTACAGGTGGAACAGGTCGAGGCCTTGAGGACCCCCCTGGC 

MDKTHTCP PCPAPELLGGP 

TCAGTCTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAG 

SI - — + + + + + -- + 120 

AGTCAGAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTC 

SVFLFPPKPKDTLMISRTPE 

GTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTAC 

121 — — + + + — + — + -+ 180 

CAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATG 

VTCVVVDVSHEDPEVKFNWY 

GTGGAC GGC GTGGAGGTGC ATAATGC C AAGAC AAAGCCGC GGGAGGAGC AGTAC AAC AGC 

181 + - + — - + -- — — + + + 240 

C AG C TGC C GC AC C TCC AC GTATT AC GGTTC TGTTTC GGC GC C C TC CTC GTC ATGT TGTC G 

VDGVEVHNAKTKPREEQYHS 

ACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAG 

241 + - + + - + + --+ 300 

TGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTC 

TYRVVSVLTVLHQDWLNGKE 

TACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAA 

301 + + + + -4- + 360 

ATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTT 

YKCKVSNKAL PAPIEKTIS K 

GC C AAAGGGC AGC C C C GAG AAC C AC AGGTGTAC AC C C TGC C C C C ATC C C GGG ATG AGCTG 

361 +--■ + + + — — + - + 420 

CGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGAC 

AKGQPREPQVYTLPPSRDEL 

ACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCC 

421 -+ + ~ — + -f +- — + 480 

TGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGG 

TKNQVSLTCLVKGFY PSDIA 

GTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTG 

481 — — + + + — + + + 540 

CACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGAC 

VEWESNGQPEN-NYKTTPPVL 

GACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAG 
541 + -= + + — - + ■ — + — + 600 

CTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTC 
DSDGSFFLYSKLTVDKSRWQ 
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FIGURE 25B 

CAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAG 

601 — + + — + + + + 660 

GTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTC 

QGNVFSC SVMHEALHKFHYTQ 

AAGAGCCTCTCCCTGTCTCCGGGTAAAGGTGGAGGTGGTGGTTGCACCACCCACTGGGGT 

661 + - + + - -+ + — + 720 

TTCTCGGAGAGGGACAGAGGCCCATTTCCACCTCCACCACCAACGTGGTGGGTGACCCCA 

KSLSLSPGKGGGGGCTTHWG 

BamHI 
I 

TTCACCCTGTGCTAATGGATCCCTCGAG 

721 — + — + - 748 

AAGTGGGACACGATTACCTAGGGAGCTC 

F T L C * 
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FIGURE 26A 



Ndel 



CATATGTGCACCACCCACTGGGGTTTCACCCTGTGCGGTGGAGGCGGTGGGGACAAAGGT 
1 + + - + + + + 60 

GTATACACGTGGTGGGTGACCCCAAAGTGGGACACGCCACCTCCGCCACCCCTGTTTCCA 

MCTTHWGFTIiCGGGGGDKG 

GGAGGCGGTGGGGACAAAACTCACACATGTCCACCTTGCCCAGCACCTGAACTCCTGGGG 

61 + + — + + + — + 120 

CCTCCGCCACCCCTGTTTTGAGTGTGTACAGGTGGAACGGGTCGTGGACTTGAGGACCCC 

GGGGDKTHTCPPCPAPELLG 

GGACCGTCAGTTTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACC 

121 + + + + + - — + 180 

CCTGGCAGTCAAAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGG 

GPSVFLFP PKPKDTLMI S RT 

CCTGAGGTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAAC 

181 + + 4- + + + 240 

GGACTCCAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTG 

PEVTCVVVDVSHEDPEVKFN 

TGGTACGTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTAC 

241 — + + -- + - + + -+ 300 

ACCATGCACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATG 

WYVDGVEVHNAKTKPREEQY 
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301 + + - + — + + ----+ 360 
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NSTYRVVSVLTVLHQDWLNG 
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361 - + + + -i- + + 420 
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KEYKCKVSNKALPAPIEKTI 

TC C AAAGC C AAAGGGC AGC C C C G AGAAC C AC AGGTGT AC ACC C TGC C C C CAT C C C GGG AT 

421 + + + + + + 480 

AGGTTTCGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTA 

SKAKGQPREPQVYTLPPSRD 

GAGCTGACCAA.GAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGAC 

481 + - + — + + + + 540 

CTCGACTGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTG 

ELTKNQVSLTCLVKGFYPSD 

ATCGCCGTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCC 

541 — + + -- + + + 600 

TAGCGGCACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGG 

IAVEWESNGQPENNYKTTPP 
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FIGURE 26B 

GTGCTGGACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGG 

601 + + — + + — + + 660 

CACGACCTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCC 

VLDSDGSF FDYSKLTVDKSR 

TGGCAGCAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTAC 

661 + +- + + + -i- 720 

ACCGTCGTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATG 

WQQGNVF S C SVMHEALHNHY 

BamHI 

ACGCAGAAGAGCCTCTCCCTGTCTCCGGGTAAATAATGGATCC 

721 + + + + 763 

TGC GTCTTCTC GGAGAGGGAC AGAGGC C C ATTTATTAC C TAGG 

TQKSLSLSPGK* 
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